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Metrology  with  Weak  Value  Amplification  and  Related  Topics:  Final  Report  -  Army  Research  Office,  Grant  No. 
W911NF-09-0-01417 


ABSTRACT 

This  report  summarizes  our  accomplishments  in  the  past  3+1  years  concerning  our  investigation  into  the  advantages  of  weak  value 
approaches  to  precision  measurements,  as  well  as  other  investigations  into  weak  measurements  and  weak  values.  The  report  is  divided  into 
two  sections:  The  first  focuses  on  a  detailed  discussion  of  results  concerning  applications  and  advantages  of  weak  value  amplification 
techniques  to  precision  measurement.  Here,  we  discuss  four  separate  but  connected  projects.  The  first  concerns  the  possibility  of  reusing 
the  rejected  photons  in  the  postselection  portion  of  the  weak  value  amplification.  We  examine  a  sequence  of  polarized  laser  pulses 
effectively  trapped  inside  an  interferometer  using  a  Pockels  cell  and  polarization  optics.  In  principle,  all  photons  can  be  post-selected,  which 
will  improve  the  measurement  sensitivity.  The  main  advantage  of  such  a  recycling  scheme  is  an  effective  power  increase,  while 
maintaining  an  amplified  deflection.  The  second  project  concerns  the  technical  merits  of  weak  value  amplification.  We  consider  models  of 
several  different  types  of  technical  noise  in  an  optical  context  and  show  that  weak  value  amplification  techniques  (which  only  use  a  small 
fraction  of  the  photons)  compare  favorably  with  standard  techniques  (which  uses  all  of  them).  Using  the  Fisher  information  metric,  we 
demonstrate  that  weak  value  techniques  give  much  higher  infonnation  over  conventional  methods  in  some  cases,  showing  a  fundamental 
advantage.  We  identify  other  cases  where  the  standard  technique  has  higher  Fisher  infonnation,  but  the  statistical  estimator  needed  to 
uncover  that  information  is  technically  difficult  to  implement,  demonstrating  technical  advantages.  We  also  discuss  a  third  project  using  our 
interferometric  weak  value  technique  to  make  precision  frequency  measurements  on  an  optical  beam,  reporting  a  resolution  of  129  +/-  7  kHz 
per  root  Hz  with  only  2  mW  of  continuous-wave  power.  The  fourth  project  deals  with  a  weak  value  inspired  phase  amplification  technique. 
We  monitor  the  relative  phase  between  two  paths  of  a  slightly  misaligned  interferometer  by  measuring  the  average  position  of  a 
split-Gaussian  mode  in  the  dark  port.  Although  we  monitor  only  the  dark  port,  we  show  that  the  signal  varies  linearly  with  phase  and  that  we 
can  obtain  similar  sensitivity  to  balanced  homodyne  detection.  We  relate  the  effect  to  an  inverse  weak  value. 

The  second  main  part  of  the  report  concerns  results  related  to  weak  measurements  and  weak  values,  but  not  directly  related  to  the 
amplification  issue.  We  give  a  brief  survey  of  our  results  in  this  area.  We  have  investigated  weak  measurements  as  a  test  of  quantum 
mechanics,  by  deriving  generalized  Leggett-Garg  inequalities  for  multiple  parties  and  violating  them  with  entangled  photons.  We  have 
made  a  systematic  analysis  of  coupled  electronic  Mach-Zehnder  interferometers,  where  one  is  used  as  a  detector  for  the  other;  this  set-up 
can  also  be  used  for  parameter  estimation.  We  have  extensively  investigated  a  new  approach  to  generalized  measurement  we  call  contextual 
values,  which  gives  a  precise  connection  between  an  indirectly  measured  system  and  the  outcomes  of  the  detector  by  assigning  a  set  of 
generalized  eigenvalues  to  each  detector  outcome.  This  assignment  permits  a  principled  definition  of  a  conditioned  average,  which  limits  to 
the  weak  value  under  certain  conditions  that  are  usually  satisfied  in  the  laboratory.  We  have  also  discussed  the  significance  of  the  imaginary 
part  of  the  weak  value.  While  the  imaginary  weak  value  is  very  useful  in  amplification  (parameter  estimation)  experiments,  we  show  it  has 
no  direct  connection  any  conditioned  average  of  the  operator  itself.  Often  the  weak  value  is  restricted  to  the  weak  limit,  where  the 
measurement  interaction  is  infinitesimally  small.  We  have  shown  that  the  general  structure  of  the  weak  value  is  actually  universal  for  any 
interaction  strength  and  any  meter  wavefiinction,  so  long  as  the  form  of  the  interaction  is  of  the  von  Neumann  type.  We  have  also 
investigated  the  concept  of  the  quantum  instrument,  and  how  it  directly  deals  with  observable  events  in  the  lab,  without  using  states  or 
observables. 
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Andrew  N.  Jordan  and  John  C.  Howell 
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This  report  summarizes  our  accomplishments  in  the  past  3+1  years  concerning  our  investigation 
into  the  advantages  of  weak  value  approaches  to  precision  measurements,  as  well  as  other  inves¬ 
tigations  into  weak  measurements  and  weak  values.  The  report  is  divided  into  two  sections:  The 
first  focuses  on  a  detailed  discussion  of  results  concerning  applications  and  advantages  of  weak  value 
amplification  techniques  to  precision  measurement.  Here,  we  discuss  four  separate  but  connected 
projects.  The  first  concerns  the  possibility  of  reusing  the  rejected  photons  in  the  postselection  por¬ 
tion  of  the  weak  value  amplification.  We  examine  a  sequence  of  polarized  laser  pulses  effectively 
trapped  inside  an  interferometer  using  a  Pockels  cell  and  polarization  optics.  In  principle,  all  pho¬ 
tons  can  be  post-selected,  which  will  improve  the  measurement  sensitivity.  The  main  advantage 
of  such  a  recycling  scheme  is  an  effective  power  increase,  while  maintaining  an  amplified  deflec¬ 
tion.  The  second  project  concerns  the  technical  merits  of  weak  value  amplification.  We  consider 
models  of  several  different  types  of  technical  noise  in  an  optical  context  and  show  that  weak  value 
amplification  techniques  (which  only  use  a  small  fraction  of  the  photons)  compare  favorably  with 
standard  techniques  (which  uses  all  of  them).  Using  the  Fisher  information  metric,  we  demonstrate 
that  weak  value  techniques  give  much  higher  information  over  conventional  methods  in  some  cases, 
showing  a  fundamental  advantage.  We  identify  other  cases  where  the  standard  technique  has  higher 
Fisher  information,  but  the  statistical  estimator  needed  to  uncover  that  information  is  technically 
difficult  to  implement,  demonstrating  technical  advantages.  We  also  discuss  a  third  project  using 
our  interferometric  weak  value  technique  to  make  precision  frequency  measurements  on  an  optical 
beam,  reporting  a  resolution  of  129  ±  7  kHz  per  root  Hz  with  only  2  mW  of  continuous-wave  power. 
The  fourth  project  deals  with  a  weak  value  inspired  phase  amplification  technique.  We  monitor  the 
relative  phase  between  two  paths  of  a  slightly  misaligned  interferometer  by  measuring  the  average 
position  of  a  split-Gaussian  mode  in  the  dark  port.  Although  we  monitor  only  the  dark  port,  we 
show  that  the  signal  varies  linearly  with  phase  and  that  we  can  obtain  similar  sensitivity  to  balanced 
homodyne  detection.  We  relate  the  effect  to  an  inverse  weak  value. 

The  second  main  part  of  the  report  concerns  results  related  to  weak  measurements  and  weak  val¬ 
ues,  but  not  directly  related  to  the  amplification  issue.  We  give  a  brief  survey  of  our  results  in  this 
area.  We  have  investigated  weak  measurements  as  a  test  of  quantum  mechanics,  by  deriving  gen¬ 
eralized  Leggett-Garg  inequalities  for  multiple  parties  and  violating  them  with  entangled  photons. 
We  have  made  a  systematic  analysis  of  coupled  electronic  Mach-Zehnder  interferometers,  where 
one  is  used  as  a  detector  for  the  other;  this  set-up  can  also  be  used  for  parameter  estimation.  We 
have  extensively  investigated  a  new  approach  to  generalized  measurement  we  call  contextual  values, 
which  gives  a  precise  connection  between  an  indirectly  measured  system  and  the  outcomes  of  the 
detector  by  assigning  a  set  of  generalized  eigenvalues  to  each  detector  outcome.  This  assignment 
permits  a  principled  definition  of  a  conditioned  average,  which  limits  to  the  weak  value  under  certain 
conditions  that  are  usually  satisfied  in  the  laboratory.  We  have  also  discussed  the  significance  of  the 
imaginary  part  of  the  weak  value.  While  the  imaginary  weak  value  is  very  useful  in  amplification 
(parameter  estimation)  experiments,  we  show  it  has  no  direct  connection  any  conditioned  average 
of  the  operator  itself.  Often  the  weak  value  is  restricted  to  the  weak  limit,  where  the  measurement 
interaction  is  infinitesimally  small.  We  have  shown  that  the  general  structure  of  the  weak  value  is 
actually  universal  for  any  interaction  strength  and  any  meter  wavefunction,  so  long  as  the  form  of 
the  interaction  is  of  the  von  Neumann  type.  We  have  also  investigated  the  concept  of  the  quantum 
instrument,  and  how  it  directly  deals  with  observable  events  in  the  lab,  without  using  states  or 
observables. 
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This  report  concerns  published  scientific  findings  made  by  the  Jordan  and  Howell  group  for  the  research  funded  by 
the  Army  Research  Office  for  the  year  2010-2013  (including  one  year  of  no-cost  extension).  The  main  theme  of  the 
proposal  is  the  use  of  weak  values  and  weak  measurements  for  the  purposes  of  precision  measurement,  in  particular 
parameter  estimation.  A  subsidiary  theme  is  investigation  into  fundamental  aspects  of  weak  measurements  and  weak 
values,  in  order  to  better  understand  their  meaning  and  applications.  On  the  first  theme,  we  summarize  our  findings 
of  four  projects  supported  by  this  grant:  The  use  of  photon  recycling  ideas  to  further  enhance  the  sensitivity  of 
this  technique,  a  detailed  analysis  of  various  technical  advantages  of  weak  value  amplification  techniques,  a  practical 
demonstration  of  precise  frequency  measurements,  and  a  second  experiment  making  precise  phase  measurements.  We 
go  to  briefly  summarize  our  supporting  theoretical  and  experimental  results.  These  include  utilizing  the  weak  value 
as  a  test  of  quantum  mechanics,  results  on  weak  measurements  and  values  in  condensed  matter  systems,  including  a 
recently  experimentally  realized  coupled  electronic  Maclr-Zehnder  interferometers  (MZI),  the  theoretical  development 
of  the  meaning  of  the  imaginary  weak  values  used  in  the  amplification  papers,  and  the  universal  form  of  a  conditioned 
average  once  away  from  the  weak  interaction  limit. 

A  long  standing  goal  in  optics  is  the  development  and  improvement  of  precision  optical  metrology.  In  the  first 
paper  on  weak  values  in  1988  [1],  Aharanov,  Albert  and  Vaidman  suggested  that  the  weak  value  effect  might  be  used 
as  an  amplifier  in  order  to  measure  (in  the  case  they  were  considering)  the  value  of  a  small  magnetic  field  by  looking 
at  the  anomalously  large  deflection  of  a  beam  of  atoms  traversing  a  Stern-Gerlach  apparatus.  The  general  validity 
of  this  weak  value  effect  was  later  shown  experimentally  in  an  optical  context  by  Ritchie  et  al.  [2],  who  replaced  the 
magnetic  spin  with  transverse  polarization,  and  Brunner  et  al.  [3] ,  who  illustrated  the  pervasiveness  of  the  weak  value 
effect  in  common  optical  telecom  networks. 

More  recently,  the  amplification  properties  of  this  weak  value  effect  have  been  exploited  in  similar  optical  systems 
to  precisely  measure  beam  deflection  [4-10],  frequency  shifts  [11],  phase  shifts  [12,  13],  time  delays  [14],  velocity 
measurements  [15],  and  even  temperature  shifts  [16],  by  using  either  polarization  or  which-path  degrees  of  freedom. 
In  most  of  these  experiments,  the  weak  value  amplification  (WVA)  technique  met  and  even  surpassed  the  sensitivity 
of  standard  techniques  in  the  field.  For  a  recent  review  of  this  and  related  weak  value  research,  see  Ref.  [17].  Although 
many  aspects  of  these  optical  experiments  can  be  described  using  classical  wave  optics  [18],  the  analysis  using  quantum 
techniques  provides  additional  insight  and  allows  for  future  extension  to  cases  with  no  classical  counterpart.  Hence, 
we  shall  continue  to  use  a  quantum  approach  in  this  work  as  well. 
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Part  I 

Weak  value  amplification:  recycling,  technical 
advantages,  and  frequency  measurements 

I.  INCREASING  PRECISION  OF  WEAK  VALUE  AMPLIFICATION  TECHNIQUES  WITH 

RECYCLED  PHOTONS 

The  first  work  in  this  report  [19]  concerns  a  theoretical  recycling  proposal  with  recycled  photons,  in  order  to  also 
make  use  of  the  photons  lost  through  the  bright  port  in  the  weak  value  scheme.  Consider  our  earlier  experiment  [5], 
where  the  tilt  of  a  moving  mirror  within  an  interferometer  is  detected  from  the  signal  on  a  split-detector.  While  this 
setup  has  a  sub-picoradian  resolution  with  only  milliwatts  of  laser  power,  there  are  a  number  of  ways  this  can  be 
improved  to  yield  even  greater  sensitivity. 

A  generic  shortcoming  of  weak-value-related  metrological  techniques  is  the  fact  that  only  a  small  fraction  of  the 
events  are  “post-selected” ,  while  the  vast  majority  of  events  are  intentionally  thrown  away.  The  main  goal  of  the 
current  work  is  to  investigate  how  this  situation  can  be  further  improved  if  those  events  are  recycled.  This  will  be  done 
by  taking  photons  which  are  not  post-selected  and  reinjecting  them  back  into  the  interferometer,  so  that  eventually, 
every  photon  can  be  post-selected  in  principle.  We  will  see  that  this  strategy  does  indeed  lead  to  an  improvement  in 
the  signal-to-noise  ratio  (SNR)  of  the  desired  parameter,  effectively  given  by  the  power  increase  of  the  split-detection 
signal.  Moreover,  since  the  existing  single-pass  weak-value  amplification  already  achieves  the  sensitivity  of  standard 
measurement  techniques  (such  as  homodyne  detection)  but  with  lower  technical  noise  [9],  the  improvements  from 
recycling  should  exceed  the  sensitivity  of  the  standard  techniques.  We  note  that  because  we  employ  not  just  a  single 
pass,  but  many  passes  of  a  given  photon  through  the  interferometer,  the  simple  weak  value  formula  used  in  the  first 
paper  on  the  subject  [5]  will  no  longer  suffice,  and  we  must  develop  a  theoretical  formalism  for  multiple  passes  that 
will  account  for  the  amplification  of  the  deflection,  as  well  as  the  probability  of  reaching  the  detector  after  some 
number  of  traversals. 

While  the  recycling  scheme  is  an  important  advance  in  its  own  right  (and  can  be  generically  applied  to  all  weak 
value  amplification  schemes),  it  also  lends  itself  to  further  enhancement  if  combined  with  other  precision  metrology 
techniques  currently  in  use.  For  example,  the  inclusion  of  a  spatial  filter  or  parity-flipping  element  to  Zeno-stabilize  the 
beam,  or  the  use  of  a  squeezed  reference  beam  [20-23]  could  significantly  reduce  degradation  effects  and  quantum  noise, 
respectively.  The  recycling  technique,  therefore,  sets  the  stage  for  combined  weak  value/quantum  light  amplification 
strategies  for  future  research.  Furthermore,  though  our  present  work  focuses  on  a  novel  pulsed  recycling  method, 
possible  extensions  to  continuous  wave  operation  may  allow  for  the  use  of  power  recycling  [24]  and  signal  recycling 
[25]  techniques,  both  of  which  are  in  use  in  modern  gravitational  wave  detectors  [26,  27]. 


A.  Qualitative  Arguments 

Our  baseline  for  comparison  will  be  the  Rochester  continuous  wave  (CW)  Sagnac  interferometric  scheme  described 
in  [5,  9,  18].  We  wish  to  improve  the  detected  SNR  by  using  a  combination  of  pulsed  laser  operation  with  the  same 
average  power  output  and  a  design  that  sends  the  undetected  portion  of  each  pulse  back  into  the  interferometer.  Such 
a  setup  is  illustrated  in  Figure  1  for  reference.  However,  before  committing  to  a  particular  recycling  design  we  can 
make  fairly  general  estimations  about  the  increases  in  sensitivity  that  we  expect  from  any  similar  recycling  scheme. 


1.  Characteristic  Time  Scales 

The  constraints  on  how  much  we  can  increase  the  power  collected  by  the  dark  port  detector  in  order  to  improve 
the  measurement  sensitivity  depend  crucially  on  the  relative  time  scales  involved,  which  include: 
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Split  Detector 


FIG.  1.  Simple  cyclic  weak  measurement  scheme.  A  laser  emits  a  pulse  of  horizontal  (H)  polarization  through  a  polarizing 
beam  splitter  (PBS),  which  travels  through  an  active  Pockels  cell  (PC)  that  rotates  the  polarization  to  vertical  (V),  after  which 
it  enters  a  Sagnac  interferometer  through  a  50:50  beam  splitter.  Inside  the  interferometer  the  combination  of  half-wave  plate 
(HWP)  and  Soleil-Babinet  compensator  (SBC)  rotates  the  pulse  polarization  back  to  H  with  a  relative  tunable  phase  shift  of  tj> 
between  the  clockwise  (O)  and  counter-clockwise  (O)  traveling  paths.  The  piezo-driven  mirror  imparts  a  transverse  momentum 
kick  k  that  differs  by  a  sign  for  the  0  and  O  paths.  A  split  detector  is  placed  at  the  dark  port  to  measure  a  resulting  pulse 
deflection.  The  H-polarized  part  of  the  pulse  that  exits  the  bright  port  is  rotated  again  by  the  active  PC  back  to  V  before 
being  confined  by  the  PBS  and  mirror  to  return  the  pulse  to  the  interferometer  through  the  PC,  now  inactive. 


Symbol 

Description 

Estimate 

T 

Laser  pulse  duration 

5  fs  -  1  ns 

T 

Laser  repetition  period 

1  ns  -  1  s 

Tr 

Traversal  period 

1  ns  -  10  ns 

Tg 

Gating  time 

2  ns 

Tp 

Minimum  inter-pulse  spacing 

T  +  Tg 

TABLE  I.  Relevant  time  scales  for  a  recycling  experiment. 


1.  The  pulse  duration  r  being  emitted  by  the  laser.  For  typical  lasers  this  can  vary  between  1  ns  and  5  fs  reasonably, 
which  correspond  to  pulse  lengths  of  0.3  nr  and  1.5  /rm,  respectively. 

2.  The  repetition  period  T  =  1//  of  the  laser.  For  typical  lasers  the  repetition  rate  /  can  vary  from  1  Hz  to  several 
GHz  reasonably. 

3.  The  traversal  period  Tr  of  the  interferometer  setup.  This  is  determined  by  the  physical  size  of  the  setup.  As  an 
upper-bound  estimate,  a  3  m  long  recycling  setup  will  have  a  total  period  of  10  ns. 

4.  The  gating  time  Tg  for  adding  new  pulses  to  the  interferometer.  This  will  determine  the  minimum  inter-pulse 
spacing  Tp  =  r  +  Tg  inside  the  interferometer.  This  also  must  be  strictly  less  than  the  time  between  pulses 
emitted  by  the  laser  Tg  <  T  —  t  so  that  every  new  laser  pulse  can  be  injected.  For  a  Pockels  cell,  Tg  ss  2  ns. 

These  are  summarized  in  Table  I. 

We  assume  in  what  follows  that  r  <Tp  <  Tr,  so  that  at  least  one  pulse  can  be  trapped  inside  the  interferometer. 
We  also  assume  that  the  average  power  output  P  of  the  pulsed  laser  is  equal  to  the  average  power  of  a  comparison 
CW  beam.  As  a  result,  the  average  power  of  each  individual  pulse  will  be  increased  by  a  factor  T/t  from  the  reference 
CW  beam. 


2.  Detector  SNR 

The  SNR  is  a  useful  indicator  for  the  sensitivity  of  the  measurement,  since  a  signal  producing  an  SNR  of  unity 
indicates  the  smallest  practically  resolvable  signal  [21].  The  detected  SNR  is  defined  as  the  ratio  of  the  collected  signal 
to  the  square  root  of  the  variance  of  that  collected  signal.  Our  raw  signal  ( S )  is  a  split-detection  of  the  transverse 
profile  of  the  beam,  which  measures  the  difference  in  photon  number  collected  by  each  side  of  the  detector,  thereby 
providing  information  about  the  horizontal  displacement  of  the  beam.  For  small  displacements,  the  variance  of  the 
split-detected  signal  is  well  approximated  by  the  second  moment,  which  is  in  turn  proportional  to  the  total  photon 
number  for  position-uncorrelated  photons  (see,  for  example,  Ref  [21]). 

The  total  accumulated  split-detected  signal  scales  linearly  with  the  average  collected  energy,  which  can  be  factored 
into  the  average  power  at  the  detector  Pd  multiplying  the  collection  duration  t.  The  variance  will  be  similarly 
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proportional  to  Pdt ,  so  the  SNR  will  scale  as  Pdt // Pdt  =  s/P/t.  Hence,  the  SNR  can  be  increased  either  by  waiting 
for  a  longer  duration  t  or  by  increasing  the  average  power  Pd  at  the  dark  port  detector.  Our  recycling  scheme  proposes 
to  increase  the  average  power  collected  within  a  fixed  duration  to  increase  the  sensitivity. 

In  the  original  interferometric  weak  value  scheme,  the  detector  collected  a  power  of  Pd  =  77P,  where  77  was  the 
post-selected  fraction  of  the  total  laser  power  P  coming  from  the  dark  port.  If  we  recycle  the  unused  light,  however, 
the  average  power  Pd  collected  at  the  detector  after  rx  recycling  passes  in  a  laser  repetition  period  T  will  have  the 
modified  form, 


Pd  =  - »?)"- V  =  (1  -  (1  -  v)rT)P, 

n—  1 


(i) 


where  77  is  the  fraction  of  the  input  power  that  exits  the  dark  port  of  the  interferometer  after  each  traversal,  and  rx 
is  the  number  of  recycled  pulses  that  hit  the  detector.  Here  we  have  ignored  optical  losses  and  detector  inefficiencies 
for  clarity.  The  power  collected  at  the  detector  after  a  single  traversal  is  ijP  and  the  SNR  scales  as  s/Pd,  so  the  net 
SNR  gain  factor  will  be, 


=  /1 — (1 — v)rT 

V  vp  V  ri 


(2) 


For  a  small  post-selection  probability — such  as  those  used  in  weak  value  experiments — then  we  can  expand  (2) 
around  r\  =  0  to  find, 


\^(l-(rT— 1  )D+0(V2). 


(3) 


For  r)(rT  —  1)  •C  1,  then  we  can  neglect  the  attenuation  of  the  pulse  to  see  an  approximate  /rx  SNR  scaling. 

For  sufficiently  large  r-r,  however,  the  factor  (2)  saturates  to  the  constant  value  1/1/77.  This  saturation  stems  from 
the  progressive  attenuation  of  the  recycled  pulse.  Furthermore,  the  smaller  the  post-selection  probability  gets,  the 
larger  we  can  make  the  possible  SNR  gain  over  a  single  pass.  In  this  limit,  however,  Pd  P  according  to  (1)  and  all 
the  photons  will  be  collected.  Note  that  despite  the  large  gain  in  power  (2)  at  the  detector,  the  best  SNR  that  one 
can  obtain  still  scales  according  to  the  standard  quantum  limit. 

The  measured  signal  at  the  detector  may  be  additionally  modified  by  geometric  and  propagation  effects,  which  we 
can  encapsulate  by  an  overall  factor  £(7 ’t)  that  depends  on  rx-  The  total  SNR  gain  factor  over  an  unrecycled  pulse 
will  then  be  prps/ Pd/r/P.  For  the  sake  of  comparison,  we  will  initially  ignore  these  effects  on  the  signal,  so  we 
will  approximate  £(rx)  ~  1  in  our  qualitative  arguments.  We  will  see  in  subsection  IB  that  for  a  collimated  beam 
£(rr)  will  approximate  unity  for  small  rx  but  will  eventually  converge  to  zero  for  large  rx-  Corrections  to  this  effect 
will  be  discussed  in  subsections  I B  2  d  and  I B  2  e,  where  we  will  see  that  one  can  maintain  a  measurable  signal  for  a 
collimated  beam  by  inverting  photons  about  the  optical  axis  on  each  traversal  or  Zeno-stabilizing  the  beam  with  a 
spatial  filter.  We  shall  also  see  in  subsection  IC  that  /rx)  can  exceed  unity  for  a  carefully  chosen  pulse  divergence, 
which  can  compensate  for  the  attenuation  effects  and  recover  the  approximate  /rx  scaling  for  a  much  wider  range  of 
rT. 


3.  Recycled  Pulse  Number 

We  can  compute  the  number  of  pulses  rx  that  hit  the  detector  per  laser  repetition  period  T  from  two  factors.  First, 
each  trapped  pulse  can  traverse  the  interferometer  roughly  r  <  T /Tr  times  each  repetition  period.  Each  traversal 
contributes  one  additional  pulse  impact  to  the  detector.  Second,  one  can  accumulate  a  maximum  of  p  <  Tr/Tp  = 
Tr/(r  +  Tg)  pulses  that  are  trapped  inside  the  interferometer.  Hence  the  total  number  of  detector  impacts  rx  =  pr 
per  period  T  will  be  bounded  by  T/(r  +  Tg).  Correspondingly,  the  maximum  SNR  gain  factor  (2)  that  we  can  expect 
from  power  considerations  will  also  be  bounded  entirely  by  the  relative  time  scales  and  the  post-selection  probability. 

In  practice,  not  every  recycled  pulse  will  contribute  constructively  to  the  SNR.  Indeed,  as  shown  in  (2)  and  as 
we  shall  see  in  subsection  IB,  there  will  be  some  maximum  number  rmax  of  constructive  pulse  impacts  before  the 
SNR  saturates  or  decays.  To  maximize  the  SNR  gain  in  such  a  case,  the  pulse  should  be  discarded  and  replaced 
by  a  fresh  pulse.  Hence,  the  number  of  practical  detector  collections  rx  will  be  less  than  the  maximum  estimation 
rx  —  pr  max  <  T/(t  +  Tg),  so  the  number  of  pulses  p  that  can  fit  inside  the  interferometer  will  become  important. 
Both  pulse  stabilization  techniques  and  diverging  lenses  can  increase  the  practical  range  of  ?’max,  which  we  will  discuss 
in  subsections  IB2d,  IB2e  and  I C. 
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4-  Practical  Estimates 

Using  the  Pockels  cell  as  a  gate,  we  expect  Tg  ss  2  ns.  Assuming  a  short  pulse  r  <C  Tg,  then  the  inter-pulse  spacing 
will  be  Tpc  ss  Tgc  =  0.6  m.  It  follows  that  the  maximum  number  of  pulses  inside  the  interferometer  will  be  p  ss  Tr/Tg. 
Assuming  a  large  3m  setup,  Tr  ss  10  ns,  so  p  «  10/2  =  5  will  be  a  generous  upper  bound  to  the  number  of  pulses 
that  we  can  expect  to  fit  inside  any  interferometer.  For  contrast,  the  smallest  setup  that  fits  only  a  single  pulse  will 
be  Tr  =  Tp,  or  Tpc  «  0.6  m  in  length. 

As  shown  in  subsection  I B  2  c,  without  loss  or  stabilization  we  can  expect  ?’max  <  80  to  be  an  optimistic  upper 
bound  for  a  constructive  number  of  recycling  passes.  The  maximum  number  of  pulse  impacts  tt  —  prmax  per  period 
T  that  we  expect  with  the  largest  setup  of  p  =  5  pulses  is  thus  tt  <  400.  Therefore,  we  can  expect  an  SNR  gain  to 
span  the  range  from  a  maximum  of  -\/400  =  20  over  a  single  pass  for  very  small  post-selection  probability  p  to  \Jl /p 
for  larger  p  according  to  (2).  Since  rmax  <  T/Tr  and  Tr  «  10  ns  for  the  3  m  setup,  the  laser  repetition  period  must 
be  T  ss  400  ns,  implying  a  repetition  rate  of  /  «  2.5  MHz.  For  contrast,  the  smallest  setup  of  0.6m  can  fit  only  p  =  1 
pulse,  so  tt  <  80.  The  SNR  gain  thus  ranges  from  a  maximum  of  V80  ss  9  over  a  single  pass  to  /p.  The  0.6  m 
setup  has  recycling  period  Tr  ss  2ns,  so  must  have  a  laser  repetition  period  T  <  160  ns,  or  rate  f  >  6.25  MHz.  These 
laser  specifications  should  be  readily  achievable  in  the  laboratory. 


B.  Analytic  Results 

1.  Sagnac  Interferometer 

Following  the  experiment  described  in  [5,  9,  18],  we  extend  the  schematic  to  pulsed  laser  operation  and  pulse  recy¬ 
cling.  As  shown  in  Fig.  1,  the  addition  of  a  Pockels  cell  (PC)  and  polarizing  beam  splitter  (PBS)  allows  the  unused 
portion  of  each  pulse  that  exits  the  bright  port  of  the  Sagnac  interferometer  to  be  redirected  back  inside  the  interfer¬ 
ometer  to  complete  multiple  traversals.  The  combination  of  half-wave  plate  (HWP)  and  Soleil-Babinet  compensator 
(SBC)  provide  a  tunable  relative  phase  c\>  between  the  clockwise  (0)  and  counter-clockwise  (O)  propagating  paths 
of  the  interferometer,  but  also  flips  the  net  polarization  of  each  pulse.  As  a  result,  the  PC  must  be  active  as  each 
pulse  initially  enters  the  bright  port  and  when  each  pulse  exits  the  bright  port  again;  however,  it  must  be  inactive  as 
each  pulse  returns  to  the  bright  port  after  being  confined  by  the  PBS  and  mirror.  By  injecting  new  pulses  exactly 
when  older  pulses  exit  the  bright  port,  one  can  minimize  the  inter-pulse  spacing  inside  the  interferometer  to  roughly 
a  single  gating  time. 

We  also  briefly  note  that  the  HWP  and  SBC  can  be  removed  in  favor  of  a  vertical  tilt  to  provide  the  relative  phase 
< f> .  With  this  variation,  the  PC  turns  on  and  off  only  once  per  repetition  period  in  order  to  inject  a  new  pulse  into  the 
interferometer,  as  opposed  to  cycling  for  every  pulse  traversal.  This  variation  does  not  change  the  inter-pulse  spacing 
inside  the  interferometer,  however,  so  it  provides  no  SNR  benefits,  though  it  does  provide  a  technical  advantage  due 
to  the  minimized  number  of  PC  cycles  per  laser  repetition  period  T. 


2.  Pulse  Recycling 

Because  there  is  no  important  interaction  between  distinct  pulses  in  the  recycling  scheme,  the  SNR  gains  are 
fundamentally  determined  by  the  effects  of  single  pulse  recycling.  Therefore,  we  shall  consider  in  some  detail  what 
happens  to  a  single  pulse  profile  after  r  passes  through  the  interferometer  under  the  assumption  that  the  pulse  remains 
collimated.  We  will  relax  the  collimation  assumption  numerically  in  subsection  IC. 

a.  Pulse  States.  Assume  the  clockwise-propagating  state  of  the  pulse  in  the  Sagnac  interferometer  is  denoted  |  O) 
and  the  counter-clockwise-propagating  state  is  denoted  |  0).  Then  the  state  that  enters  the  interferometer  through 
the  50:50  beam  splitter  will  have  the  form,  | ip+)  =  -U  (|  0)  +  j|  0)).  This  will  also  be  the  post-selection  state  for  the 
bright  port  of  the  interferometer.  Similarly,  the  post-selection  state  for  the  dark  port  of  the  interferometer  will  have 
the  orthogonal  form,  ]■)/>_)  =  -U  (|  0)  —  i|  0)).  We  also  define  the  which-path  operator  as  W  =  |  O)(0  |  —  |  0}(0  |. 

Assume  the  initial  transverse  pulse  profile  is  given  by  a  state  | ip).  We  normalize  the  state  of  the  transverse  pulse 
profile  so  that  its  squared  norm  will  encode  the  average  photodetection  rate.  Hence,  measuring  a  pulse  with  a  detector 
for  the  pulse  duration  r  will  produce  N  =  t||<^||2  photon  impacts  upon  the  detector  per  pulse  on  average.  This  choice 
of  normalization  will  allow  simple  computation  of  the  SNR  without  multi-particle  Fock  space  calculations  (e.g.,  as 
used  in  [21]). 

The  total  initial  pulse  state  that  enters  the  interferometer  will  have  the  product  form,  |Tq)  =  \if>+)\(p).  For  simplicity 
we  suppress  the  polarization  of  the  state  and  any  propagation  effects. 
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FIG.  2.  Strongly  misaligned  regime  with  <j>  <  1  <  ka.  Left:  the  transverse  pulse  profile  nr(x)  that  impacts  the  dark  port 
detector  on  the  first  (blue,  solid)  and  second  (red,  dashed)  traversals.  Right:  the  accumulated  transverse  pulse  profile  nr(x ) 
on  the  dark  port  detector  after  a  single  (blue,  solid)  and  ten  (red,  dashed)  traversals.  For  this  regime  the  interference  pattern 
covers  the  entire  profile,  subsequent  pulses  are  strongly  attentuated,  and  the  interference  of  the  accumulated  profile  is  slowly 
filled  in. 


The  traversal  through  the  interferometer  performs  three  operations  on  the  state.  The  first  is  the  passage  through 
the  SBC  and  HWP,  which  creates  a  relative  phase  shift  <j>  between  the  paths  that  can  be  described  by  the  unitary 
operator  Usbc  =  elr^w^2.  The  second  is  the  tilting  piezo  mirror,  which  imparts  a  transverse  momentum  kick  k  to  the 
pulse,  described  by  the  unitary  operator  Up  =  e~lkWx  i  where  K  =  1  and  the  transverse  position  operator  x  generates 
a  momentum  translation  k.  The  third  is  a  generic  uniform  loss  with  probability  7,  described  by  a  nonunitary  loss 
operator,  L  =  y/1  —  7I. 

The  state  of  the  pulse  profile  as  it  arrives  back  at  the  50:50  beam  splitter  after  one  traversal  has  the  form,  |'ki)  = 
iUpUsBcl^o)-  After  the  pulse  traverses  the  50:50  beam  splitter,  it  splits  into  two  paths  once  more.  The  dark  port 
projects  the  photon  onto  the  | V’-)  state,  and  the  bright  port  projects  the  photon  onto  the  \i/j+}  state.  Hence,  we  obtain 

the  following  two  states  in  the  bright  and  dark  ports,  respectively,  |\I/±)  =  \ip±)  where  we  have  factored 

out  the  measurement  operators  M±  =  (V7t|TUpUsBcl^+)  that  affect  the  transverse  profile  of  the  pulse  in  each  case. 
Written  out  explicitly,  these  measurement  operators  are  diagonal  in  the  position  basis  and  have  a  remarkably  simple 
form, 


M+  =  y/l^  7  cos  ((j)/2  —  kx ) , 
M  =  i \J\  —  7  sin  (</>/ 2  —  kx) , 


(4a) 

(4b) 


where  we  have  used  ( il}±\Wn\ip+ )  =  (1  ±  (— l)n)/2. 

b.  Number  Densities.  Using  the  measurement  operators  (4),  the  exact  pulse  state  that  exits  the  dark  port  after 
r  traversals  through  the  interferometer  will  be  I’I'Il)  =  \>p-)  ^A^-(A^+)r_1|v5))  ■  Therefore,  the  number  density  nr{x) 
of  photons  that  hit  the  dark  port  detector  at  a  transverse  position  x  on  the  rth  pulse  traversal  is, 

nr(x)  =  r|(a:|4'!))|2  =  n0{x)(  1  -  j)rx  (5) 

—  kx^j  cos2(r_1) 

where  no(x)  =  t|(:t|7>)|2  is  the  number  density  for  the  input  pulse. 

The  total  number  density  nr(x)  that  accumulates  on  the  dark  port  detector  after  r  traversals  of  the  pulse  will  be 
the  sum  of  the  number  densities  for  the  r  traversals, 


r(x)  =^2nj(x) 

3= 1 

(1  -  7)  (1  -  [(1  -  7)  cos2  {(/)/ 2  -  kx)] ' ) 


=  no(x)~ 


1+7  cot2  ((/)/ 2  —  kx) 


n . 


(6) 


Hence  the  total  number  of  photons  that  hit  the  detector  after  r  traversals  is  Nr  =  fdxnr(x).  Furthermore,  if  we 
compare  (6)  to  the  heuristically  estimated  detector  power  (1)  when  7  — >■  0,  we  see  that  the  spatially  resolved  version 
of  the  post-selection  probability  is  given  by  rj  sin2(</>/2  —  kx). 

In  the  limit  of  an  infinite  number  of  trials  r  — >  00,  the  final  term  in  (6)  vanishes  and  we  are  left  with  the  number 
density, 


rioo{x)  =  n0  (x) 


1-7 


1  +  7cot“  (</>/ 2  —  kx) 


(7) 


For  no  loss,  7  — >  0,  the  modulating  factor  from  the  measurement  cancels  and  the  original  pulse  is  completely  recovered , 
which  is  surprising  since  for  the  first  pass  there  is  an  anomalously  large  position  shift.  This  means  that  if  all  the  photons 
in  a  perfectly  collimated  pulse  are  collected  through  repeated  recycling,  then  the  information  about  the  measurement 
will  be  erased  due  to  a  progressive  smearing — or  walk-off — effect  of  the  interference  pattern,  as  illustrated  in  Figs.  2, 
3,  and  4.  Such  a  result  indicates  that  a  collimated  pulse  should  be  thrown  away  or  reshaped  after  a  finite  number 
of  traversals  in  order  to  maximize  the  information  collected  at  the  detector  regarding  the  momentum  kick  k  and  the 
induced  phase  shift  <f>. 
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FIG.  3.  Weak-value  regime  with  ka  <  (j>  <  1,  with  parameters  chosen  to  exaggerate  the  walk-off  effect.  Left:  the  transverse 
single  lobed  pulse  profile  nr{x)  that  impacts  the  dark  port  detector  on  the  first  four  traversals  in  order  of  (blue,  solid),  (red, 
dashed),  (purple,  dot-dashed),  and  (brown,  dotted).  Right:  the  accumulated  transverse  pulse  profile  nr{x )  on  the  dark  port 
detector  after  the  first  four  traversals,  with  the  same  color  coding.  For  this  regime,  the  dark  port  profile  resembles  a  single  shifted 
Gaussian  that  gradually  walks  back  toward  the  center  on  multiple  traversals  with  some  attenuation,  eventually  recovering  the 
original  profile.  For  more  realistic  parameters,  such  as  those  used  in  [5],  the  walk-off  effect  is  smaller  per  traversal. 

c.  Gaussian  Pulse.  To  gain  some  intuition  about  the  collected  number  density  (6),  consider  an  initial  pulse  with 
a  zero-mean  Gaussian  transverse  profile. 


n0(x) 


N 
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In  what  follows,  we  will  consider  three  specific  parameter  regimes  for  the  Gaussian  pulse: 


1.  the  strongly  misaligned  regime  <f>  <  1  <  ka 


(8) 


2.  the  weak-value  regime  ka  <  (f>  <  1 

3.  the  inverse  weak-value  regime  (j>  <  ka  <  1 

These  regimes  are  illustrated  in  Figures  2,  3,  and  4,  respectively. 

In  the  strongly  misaligned  regime  (f>  <  1  <  ka ,  the  profile  that  exits  the  dark  port  on  each  traversal  (5)  is  shown 
in  Fig.  2.  The  interference  pattern  covers  the  entire  beam  profile.  On  the  first  pass,  the  intensity  of  the  peaks  match 
the  maximum  intensity  of  the  beam.  Subsequent  passes  are  strongly  attenuated  due  to  the  small  overlap  with  the 
complementary  interference  pattern  in  the  beam  that  remains  inside  the  interferometer.  The  accumulated  profile 
nr(x)  in  (6)  steadily  shrinks  the  width  of  the  interference  dips  with  increasing  traversal  number  until  the  entire  beam 
profile  is  recovered.  The  strongly  misaligned  regime  is  unlikely  to  be  useful  in  a  precision  measurement  due  to  the 
large  value  of  k;  we  have  included  it  in  our  discussion  for  completeness  and  to  emphasize  that  the  single  and  double 
lobes  that  appear  in  the  other  regimes  are  not  simple  beam  shifts,  but  appear  from  an  interference  effect. 


9 


nr(x) 


n,.(x) 


FIG.  4.  Inverse  weak-value  regime  with  <j>  <  ka  <  1,  with  parameters  chosen  to  exaggerate  the  walk-off  effect.  Left:  the 
transverse  double-lobed  pulse  profile  nr(x)  that  impacts  the  dark  port  detector  on  the  first  four  traversals  in  order  of  (blue, 
solid),  (red,  dashed),  (purple,  dot-dashed),  and  (brown,  dotted).  Right:  the  accumulated  transverse  pulse  profile  nr( x)  on  the 
dark  port  detector  after  the  first  four  traversals,  with  the  same  color  coding.  For  this  regime,  there  are  two  lobes  that  very 
gradually  walk  back  toward  the  center  on  multiple  traversals  with  some  attenuation  to  eventually  recover  the  original  beam 
profile.  For  more  realistic  parameters,  such  as  those  used  in  [12,  13],  the  walk-off  effect  is  smaller  per  traversal. 


For  ka  <  <f>  <  1,  we  obtain  the  weak-value  parameter  regime  considered  in  [5].  The  interference  pattern  in  the 
number  density  nr(x)  indicated  in  (5)  leaves  a  single  displaced  peak  that  resembles  a  shifted  Gaussian  that  is  shown 
in  Fig.  3.  Subsequent  traversals  have  similar  intensities,  but  progressively  walk  toward  the  center  with  increasing  r. 
The  amplified  signal  comes  from  the  anomalously  large  shift,  so  this  walk-off  degrades  the  amplification  properties  of 
the  setup  with  increasing  r. 

The  walk-off  effect  arises  because  the  beam  that  remains  inside  the  interferometer  has  had  a  small  fraction  of  light 
removed  by  the  post-selection  from  one  side,  which  causes  a  complementary  displacement  in  the  opposing  direction. 
This  complementary  shift  counter-acts  the  dark-port  displacement  on  subsequent  traversals,  which  makes  the  output 
walk  back  toward  the  center  of  the  original  profile.  Hence,  after  r  traversals  the  accumulated  weak-value  signal  nr{x) 
(6)  will  resemble  r  times  the  intensity  of  a  single  traversal,  but  will  also  be  degraded  due  to  the  walk-off  effect.  The 
walk-off  is  shown  exaggerated  in  Fig.  3,  but  is  a  smaller  effect  per  traversal  with  more  realistic  parameters,  such  as 
those  in  [5].  However,  even  though  the  effect  per  traversal  is  smaller,  for  a  sufficiently  large  number  of  traversals  the 
signal  will  always  be  completely  erased  by  this  walk-off  effect  according  to  (7). 

For  <j)  <  ka  <  1,  we  enter  the  inverse  weak-value  regime  considered  in  [12,  13]  and  originally  observed  in  [2].  The 
weak-value  assumptions  that  produce  the  single  peak  break  down  and  (5)  produces  the  double-lobed  profile  shown  in 
Fig.  4.  On  multiple  traversals  the  peaks  gradually  walk  back  toward  the  center,  similarly  to  the  weak-value  regime. 
However,  the  forced  zero  in  the  center  will  stabilize  the  profile,  so  that  after  r  traversals  the  accumulated  profile  nr( x) 
(6)  will  more  closely  resemble  r  times  the  intensity  of  a  single  traversal  than  in  the  weak-value  regime.  The  walk-off 
is  shown  exaggerated  in  Fig.  4,  but  is  also  a  smaller  effect  per  traversal  with  more  realistic  parameters,  such  as  those 
used  in  [12,  13]. 

These  different  regimes  for  weak  value  amplification  measurements  are  also  carefully  explored  in  the  recent  review 
paper  [28]. 

d.  Parity  Flips.  A  simple  technique  for  compensating  for  the  profile  erosion  on  multiple  traversals  is  to  invert 
the  profile  around  the  x  =  0  line  so  that  each  new  traversal  partially  cancels  the  walk-off  from  the  previous  traversal. 
This  can  be  accomplished  by  introducing  a  parity-flipping  optic  represented  by  a  parity  operator  Px  that  modifies 
the  profile  by  replacing  x  — ►  —x.  This  results  in  a  net  replacement  of  the  operator  M+  — >  PXM+  in  (4).  After  an 
even  number  of  traversals  2 r,  the  accumulated  number  density  (6)  then  has  the  modified  form, 


ri2r(x)  =  no(x)  sin2  (  A  _  kx  J  (  1  +  cos2  (  A  —  kx 


1  - 

cos2  | 

t  -  kx ) 

cos2  | 

^  +  kx ^ 

r 

1  —  cos2 

(2  -  kx 

I  cos2 

(2  +kx 

(9) 
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FIG.  5.  Normalized  split-detection  response  for  a  collimated  pulse.  Left:  response  for  the  weak-value  regime  as  a  function  of 
ka  with  fixed  <f>,  where  ka  <  (j>  <  1,  and  with  parameters  consistent  with  Ref.  [5].  The  accumulated  signal  is  shown  for  traversal 
numbers  r  =  1  (blue,  solid),  r  =  5  (red,  dashed),  r  =  10  (purple,  dot-dashed),  and  r  =  20  (brown,  dotted).  Right:  response 
for  the  inverse  weak-value  regime  as  a  function  of  cj>/n  with  fixed  k,  where  (j>  <  ka  <  1,  and  with  parameters  consistent  with 
Ref.  [12,  13].  The  accumulated  signal  is  shown  for  the  same  traversal  numbers  and  color  coding.  Though  the  walk-off  effects  at 
large  traversal  numbers  change  the  slope  in  both  regimes,  the  linear  response  is  preserved.  Hence,  one  can  calibrate  the  slope 
through  repeated  experiments  with  a  fixed  number  of  traversals  per  laser  period.  The  slope  is  negative  here  since  the  signal  is 
negative  in  Eq.  (20). 
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FIG.  6.  Normalized  split-detection  response  for  a  collimated  pulse  stabilized  by  parity  flipping,  using  the  same  color  coding  as 
in  Figure  5.  Left:  response  for  the  weak-value  regime  as  a  function  of  ka  for  fixed  <j>,  where  ka  <  <j>  <  1.  All  traversal  numbers  r 
have  the  same  linear  response  due  to  the  parity-flip  stabilization;  Zeno  stabilization  produces  the  same  result.  Right:  response 
for  the  inverse  weak-value  regime  as  a  function  of  <f>/ir  for  fixed  k,  where  (j>  <  ka  <  1.  The  linear  response  acquires  a  steeper 
slope  for  larger  traversal  number  using  parity-flip  stabilization;  Zeno  stabilization,  however,  would  result  in  identical  slopes  for 
any  r. 


where  we  have  set  7  — >■  0  for  clarity.  Unlike  (6)  where  we  do  not  flip  the  output  of  the  bright  port  on  each  traversal, 
this  expression  does  not  yield  the  original  input  profile  in  the  limit  of  large  r.  Instead  it  yields, 


noo(x)  =  n0{x) 


sin2  —  kx'j  ^1  +  cos2  —  kx ^ 

1  —  cos2  —  kx^  cos2  +  kx ^ 


(10) 


which  maintains  a  signal,  in  contrast  with  the  case  of  no  parity  flips  on  a  collimated  beam. 

e.  Zeno  stabilization.  Another  way  to  reduce  transverse  walk-off  effects  and  thereby  restore  the  SNR  to  the 
power-limited  scaling  of  (2)  in  subsection  I A  2  is  to  utilize  the  physics  of  the  quantum  Zeno  effect  by  using  an  optical 
filter  to  project  the  transverse  profile  back  into  its  original  state.  The  advantage  of  the  Zeno  stabilization  over  parity 
flipping  is  that  the  former  does  not  swap  the  transverse  locations  of  the  photons;  this  may  be  important  when  using 
quantum  states  of  light,  e.g.,  squeezed  or  entangled  states,  whose  benefits  rely  on  maintaining  transverse  correlations 
between  the  photons. 
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At  every  round  of  the  recycling  with  Zeno  stabilization,  the  beam  is  passed  through  a  spatial  filter,  so  if  the  beam 
is  in  its  original  profile,  it  will  pass  through  the  filter  perfectly;  however,  if  the  waveform  is  distorted,  then  a  photon 
in  that  mode  will  have  some  probability  to  be  absorbed.  In  passing  through  the  filter  on  each  traversal,  a  photon  in 
this  mode  will  only  experience  a  small  disturbance  to  the  transverse  profile,  and  the  state  will  tend  to  “freeze”  in  its 
original  state  with  only  a  small  rate  of  being  projected  into  an  orthogonal  state  (in  this  case,  being  absorbed  by  the 
filter).  We  note  this  technique  will  work  regardless  of  the  nature  of  the  disturbance,  provided  it  is  small  in  each  pass. 

To  see  how  this  works,  let  us  consider  the  Gaussian  transverse  state  in  (8)  for  a  single  photon  (N  =  1).  The 
corresponding  transverse  spatial  state  for  this  photon  has  the  form 

Mx)  =  <*#0)  =  (2J2)1/4  (-^)  ■  (11) 

After  one  traversal  through  the  interferometer,  the  state  emerging  from  the  bright  port  according  to  (4)  is 

\<t>i)  =  M+|<£o)  =  \/l  -  7cos(^>/2  -  kx)\<t>0).  (12) 

To  compute  the  reshaping  probability,  we  renormalize  this  state  by  dividing  out  its  norm 

(</>i|<£i)  =  (1  -  7)  J dx\4>o{x)\2  cos2((j)/2  —  kx)  (13) 

=  (1  -  q)(l  +  e-2fc2<T“  cos<£)/2, 
to  produce  the  normalized  state  \4>i,n)  =  |</>i) / \J (<^i|<^i)- 

If  we  now  make  a  projective  measurement  with  a  spatial  filter  of  the  shape  |(a;|</>o)|2,  the  photon  will  be  restored  to 
the  state  |<fo)  with  a  probability  P\  =  \(4>o\^i,n)\2  ■  The  probability  can  be  calculated  from 

(</Wi ,n)  =  J dx\(f>o(x)\2  cos(<£/2  -  kx),  (14) 

v/2e-fc2°'2/2  cos(<^/2) 

\/l  +  e~2k2°2  cos  (j) 

We  are  interested  in  the  case  where  both  <t>  and  ka  are  less  than  1.  Consequently,  we  can  expand  Pi  to  leading  order 
in  ka  and  <f>, 

Pi  =  1  -  (ka)4/ 2  -  ( ka)2(t>2/A  +  ...,  (15) 

where  we  drop  terms  of  higher  order  in  powers  of  (ka)2  and  (f)2 .  In  the  weak-value  regime  where  ka  <  (f>  <  1,  the 
second  term  in  (15)  may  be  dropped.  In  the  inverse  weak-value  regime  where  <j>  <  ka  <  1,  the  third  term  in  (15)  may 
be  dropped.  In  either  case,  for  repeated  cycles  consisting  of  M  independent  measurements  the  probability  Pm  =  P\ 1 
of  being  projected  back  into  state  (x|(/>o)  will  decay  approximately  exponentially  as 

Pm  =exp[— MT],  (16) 

where  T  «  (ka)4/2  +  (ka)2 </>2 / 4  is  an  effective  decay  rate.  We  can  therefore  make  M  ~  Mz  =  1/T  measurements 
before  a  photon  is  typically  absorbed  by  the  reshaping  filter.  This  is  the  manifestation  of  the  Zeno  effect,  where  by 
making  repeated  projections,  the  state  is  kept  in  its  initial  state  for  much  longer  than  would  happen  otherwise. 

This  Zeno  number  Mz  is  many  more  cycles  that  we  will  be  able  to  make  before  the  detector  measures  all  the 
photons  exiting  the  dark  port.  For  example,  if  we  chose  the  exaggerated  values  ka  =  0.1  and  <f>  =  7r/8  as  in  Figure  3 
then  this  gives  a  Zeno  number  of  Mz  ~  2.3  x  103 ,  which  is  still  an  order  of  magnitude  larger  than  we  require  for  the 
detection  physics. 


3.  Split- detected  signal 


In  order  to  measure  the  transverse  momentum  kick  k  or  the  phase  shift  <f>  we  compare  the  sides  of  the  transverse 
profile  using  a  split  detector.  As  outlined  in  subsection  I A  2,  the  accumulated  split-detected  signal  after  r  pulse 
repetitions  of  time  duration  r  is  given  by  the  difference  of  the  number  densities  on  each  side, 


da;  nr(x) 


da;  nr(  x). 


(17) 
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FIG.  7.  SNR  gain  versus  traversal  number  r  for  a  collimated  pulse.  Left:  the  SNR  gain  for  the  weak-value  regime  ka  <  <j>  <  1 
with  ka  =  10~4  and  <j>  =  tv/16.  The  uncorrected  beam  with  walkoff  (blue,  solid)  shows  clear  degradation  with  traversal  number, 
while  the  beam  corrected  with  parity  flipping  (red,  dashed)  as  in  Sec.  I B  2  d  matches  the  simple  power  scaling  law  exactly  (black, 
dashed)  from  Eq.  (2).  Note  that  Zeno  stabilization  will  identically  produce  this  power-scaled  SNR  by  construction.  Right:  the 
SNR  gain  for  the  inverse  weak-value  regime  <j>  <  ka  <  1  with  ka  =  0.1  and  <f>  =  n/256.  The  uncorrected  double-lobed  beam 
with  walkoff  (blue,  solid)  shows  somewhat  less  degradation  than  the  weak-value  regime  due  to  the  forced  zero  in  the  profile; 
however,  the  double-lobed  beam  corrected  with  parity  flipping  (red,  dashed)  manages  to  exceed  the  simple  power  scaling  law 
(black,  dashed),  and  thus  the  scheme  with  Zeno  stabilization. 


To  measure  the  displacement  of  the  pulse,  the  signal  should  be  subsequently  normalized  by  the  total  photon  number 
( S)r/Nr  in  order  to  extract  the  averaged  behavior. 

For  small  displacements  the  variance  of  the  raw  split  detected  signal  is  the  second  moment  to  a  good  approximation, 


(A S)l  »  (S2)r  =  J d xnr(x)  =  Nr, 


(18) 


which  is  the  total  number  of  photons  that  have  impacted  at  the  dark  port  detector.  Hence,  the  SNR  that  has 
accumulated  after  the  rth  traversal  will  be  given  by, 


SNRr 


(S)r 

(A  S)r 


(S)r 

VN~r' 


(19) 


For  the  zero-mean  Gaussian  (8)  these  quantities  can  be  computed  exactly  for  the  first  traversal, 


iVi 

(S)i 
(S) 1 

Ni 

SNRi 


(1  -  7 )N 


(l  _  g-(2 ka) 


72 


COS 


(1  -  7 )N  _ 
2  6 

— Erfi(v/2fc<r) 

-^(1-7  )N 


-(2fcCT)2/2Erfi(V2fccr)  sin  (f>, 

e-(2ka)2 /2  gjn^ 

1  —  e-(2fccr)2/2  cos  cj)1 
Erfi('\/2fccr)e_^2fcfT^2/2  sin  <j> 
V/ 2(1  -  e— (2fecr)2/2  cos  <j>) 


(20a) 

(20b) 

(20c) 

(20d) 


where  Erfi(a:)  =  Erf (ix)/i  =  (2/y7r)  fje*  dt  is  the  imaginary  error  function.  We  now  specialize  these  exact  solutions 
to  the  two  amplification  regimes  under  consideration  and  indicate  numerically  how  larger  traversal  numbers  behave 
in  each  regime. 

a.  Weak-value  regime.  When  ka  <  </>  <  1  then  we  can  neglect  terms  of  order  (ka)2  in  (20)  to  find, 


(1  -  7)  A  sin2(</>/2), 


(1  —  "f)Nka  sincj), 


(21a) 

(21b) 
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—  y  —  2ka  cot(<(>/2), 

—  y^V (1  -  7)1V (2ka  cos(0/2)) 

—  \j^\fN[(2ka  cot{(j>/2)). 


(21c) 

(21d) 


These  linear  order  solutions  correctly  match  the  weak  value  analyses  made  in  [5,  9,  18],  as  expected.  Due  to  the 
factor  cot(</>/2)  in  the  normalized  split  detection  (S)i/Ni,  setting  a  known  small  (j)  provides  an  amplification  factor 
for  measuring  an  unknown  small  k.  This  regime  gets  its  name  from  the  fact  that  this  amplification  factor  is  the 
imaginary  part  of  the  weak  value  Ww  =  (^_|W'|V’0)/(V’-IVV>)  =  icot(</>/2)  of  the  which-path  operator  W  with  initial 
state  | ip^)  =  [/sbc|'0+)  and  post-selection  state  ]'*/’-)•  The  normalized  signal  for  this  parameter  regime  is  shown  in 
the  left  plot  of  Fig.  5  as  a  function  of  ka,  demonstrating  the  linear  response. 

We  can  reproduce  the  dominant  SNR  gain  factor  for  small  post-selection  probability  by  neglecting  the  walk-off 
effects  and  the  power  attenuation.  To  do  this,  we  expand  the  accumulate  profile  nr(x)  in  (6)  to  first  order  in  kc r  and 
second  order  in  </>  to  obtain, 


nr{x) 


rc{^,r)n0(x) 


c(y,r) 


(i  -7)(i  -  (i-7r) 


(22) 

(23) 


where  lining  0(7,  r)  =  1. 

The  only  r-dependence  in  the  number  density  is  in  the  numeric  prefactor  rc( 7,  r) ,  which  effectively  scales  the  total 
photon  number  N  -7  rc{^,r)N.  Using  this  scaling,  the  result  (21)  for  the  split-detector  will  hold  for  any  r  to  second 
order  in  <f>  and  first  order  in  k.  Hence,  the  SNR  should  scale  as  y/7Vrc( 7,  r)  when  walk-off  and  power  attenuation 
effects  are  neglected.  When  7  — >  0,  this  recovers  the  dominant  y[r  SNR  enhancement  factor  that  we  found  to  zeroth 
order  in  the  post-selection  probability  of  (2)  in  subsection  I A  2  from  power  considerations. 

However,  the  walk-off  effects  and  power  attenuation  combine  to  reduce  the  actual  SNR  below  this  optimistic  level. 
To  see  this,  consider  the  solid  blue  curve  in  the  left  plot  of  Fig.  7,  which  shows  the  split-detected  SNR  gain  versus 
traversal  number  for  the  weak-value  regime.  The  SNR  gain  for  any  sufficiently  small  ka  is  universal,  but  plateaus 
quickly  due  to  the  beam  degradation  from  the  walk-off.  Even  worse,  for  sufficiently  large  traversal  number  r  the  signal 
will  eventually  decline  and  then  converge  to  zero  due  to  the  erasure  effect  implied  by  (7),  so  the  SNR  gain  factor  will 
also  correspondingly  decay  to  zero. 

The  power  scaling  in  (2)  can  be  recovered,  however,  if  the  walk-off  is  corrected  with  the  parity  flipping  method 
discussed  in  subsection  I B  2  d.  The  normalized  signal  produced  with  the  parity-flip  correction  illustrated  in  the  left 
plot  of  Fig.  6 — has  an  identical  slope  for  any  traversal  number,  demonstrating  the  simple  power  scaling  behavior. 
This  correction  is  illustrated  as  the  dot-dashed  red  curve  in  the  left  plot  of  Fig.  7,  which  exactly  overlaps  the  power 
scaling  curve  illustrated  as  the  dashed  black  curve.  If  the  walk-off  is  corrected  with  Zeno  stabilization  instead,  then 
the  signal  slope  will  be  identical  for  any  traversal  number  by  construction,  and  the  SNR  gain  will  also  exactly  follow 
the  power  scaling  curve  in  (2). 

b.  Inverse  weak-value  regime.  If  (j)  <  ka  <  1,  then  the  approximation  to  linear  order  in  ka  will  break  down,  as 
shown  earlier  in  subsection  IB 2c.  For  this  regime,  we  keep  linear  order  in  <f>  and  second  order  in  ka  in  (20)  to  find, 


iVi 

(S)i 

(S) 1 

iVi 


(1  -  'y)N(ka)2, 


-\/-( 1  -  7 )Nka<j>, 


SNRi 


=  \[^V( i-7)w  (J(M2  -  0 

-fi/*  (£*-*)* 


(24a) 

(24b) 

(24c) 

(24d) 


In  contrast  to  the  previous  approximation,  the  1/k  term  in  the  normalized  signal  ( S)i/Ni  leads  to  an  amplification 
in  measuring  an  unknown  (j)  given  a  known  small  k.  Indeed,  this  regime  was  used  in  Refs.  [12,  13]  for  exactly  this 
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Symbol 

Numerical  Value 

£ 

1.5  m 

a 

1  mm 

k 

1  x  1CT3  m'1 

ko 

8  x  106  m’1 

7 

0.01 

d 

1  cm 

TABLE  II.  Parameters  used  for  numerical  computations.  I  is  half  the  length  of  the  interferometer,  k  is  the  momentum  kick 
from  the  mirror,  a  is  the  input  beam  width,  ko  is  the  carrier  momentum,  7  is  the  loss  per  traversal,  and  d  is  the  half-width  of 
the  split-detector. 


purpose.  In  the  preprint  version  of  Ref.  [13]  it  was  noted  that  <j)  ss  21mW~1  is  the  inverse  of  the  weak  value  present 
in  the  weak-value  regime  for  small  </),  which  motivates  our  name  for  this  parameter  regime;  this  inverted  relationship 
has  also  been  rediscovered  more  recently  in  Ref.  [28].  The  normalized  signal  for  this  parameter  regime  is  shown  in 
the  right  plot  of  Fig.  5  as  a  function  of  <f>,  demonstrating  the  linear  response. 

Again,  we  can  reproduce  the  dominant  SNR  gain  factor  for  small  post-selection  probability  by  neglecting  the  walk- 
off  effects  and  the  power  attenuation,  which  can  be  done  by  expanding  nr(x)  to  second  order  in  ka  and  first  order  in 
4>  to  obtain, 


nr(x)  =  rc( 7, r)no(x)  {—kx(f>  +  k2x2) , 


(25) 


with  the  same  c(y,  r)  as  in  (23). 

As  with  the  weak-value  regime,  the  only  r-dependence  in  the  number  density  is  in  the  numeric  prefactor  rc(y,  r) , 
which  effectively  scales  the  total  photon  number  N  -7  rc( 7,  r)N.  Using  this  scaling,  the  result  (24)  for  the  split- 
detector  will  hold  for  any  r  to  second  order  in  k  and  first  order  in  <j> .  Hence,  the  SNR  will  scale  as  yjNrc{ 7,  r) 
when  walk-off  and  power  attenuation  effects  are  neglected.  When  7  -7  0,  this  also  recovers  the  dominant  y/r  SNR 
enhancement  factor  that  we  found  to  zeroth  order  in  the  post-selection  probability  of  (2)  in  subsection  IA2  from 
power  considerations. 

As  before,  the  walk-off  and  power  attenuation  effects  reduce  the  SNR  gain  below  this  optimistic  level.  Consider  the 
solid  blue  curve  in  the  right  plot  of  Fig.  7,  which  shows  the  exact  split-detected  SNR  gain  versus  traversal  number  for 
the  inverse  weak-value  regime.  As  anticipated  in  subsection  IB  2c,  the  forced  zero  in  the  center  of  the  double-lobed 
profile  naturally  stabilizes  the  beam  to  produce  a  saturated  SNR  for  more  traversals  than  the  weak-value  regime. 
However,  the  SNR  still  plateaus  relatively  quickly  before  eventually  decaying  to  zero  for  a  sufficiently  large  number 
of  traversals  r  without  additional  stabilization. 

Similarly,  the  degradation  from  walk-off  can  be  completely  reversed  by  employing  the  parity  flipping  technique.  The 
dot-dashed  red  curve  in  the  right  plot  of  Fig.  7  actually  exceeds  the  simple  power  scaling  law  illustrated  as  the  dashed 
black  curve  due  to  an  additional  accumulation  of  momentum  information  on  each  subsequent  traversal.  Moreover, 
the  normalized  signal  shown  in  the  right  plot  of  Fig.  6  shows  a  corresponding  increase  in  the  slope  of  SNR  vs.  <j)  with 
traversal  number. 


C.  Diverging  Pulse 

By  assuming  a  collimated  beam,  we  have  so  far  neglected  beam  propagation  effects  in  the  analysis,  as  well  as 
any  lens  effects  that  could  further  change  the  detection  physics.  In  order  to  incorporate  these  effects,  we  now  alter 
our  measurement  operators  and  pursue  a  numerical  approach.  We  find  that  these  effects  may  slightly  enhance  the 
SNR  gains  from  recycling  before  saturation  due  to  the  finite  detector  size,  but  do  not  fundamentally  alter  the  basic 
power-scaling  behavior.  For  that  reason,  it  will  be  sufficient  to  illustrate  only  the  weak-value  regime  as  an  example. 

Extending  the  collimated  analysis  in  subsection  I B  to  include  beam  propagation  leads  to  a  replacement  of  the 
measurement  operators  with  M±  — >  M'±  =  where  Ui  =  exp(— ip2f/2fc0),  fco  is  the  carrier  momentum  of 

the  pulse,  and  t  is  the  propagation  length  from  piezo  to  50:50  beam  splitter  [29].  The  number  density  (5)  will  then 
involve  the  composite  measurement  operator 

M_  (m;)'  1  =  TJtM_Ut  (ueM+Uey  1 


(26) 
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FIG.  8.  The  effect  of  propagation  and  different  diverging  lens  choices  on  the  SNR  of  the  weak-value  regime  with  ka  =  1CEC  and 
(j>  =  7r/16.  In  order  of  (blue,  solid),  (red,  dashed),  (purple,  dot-dashed),  and  (brown,  dotted)  we  show  weak  initial  diverging 
lenses  with  extreme  focal  lengths  s,  =  -1  m,  -  5  m,  -  10  m,  and  no  lens.  The  (black,  dashed)  curve  shows  the  scaling  given 
by  power  considerations  in  Eq.  (2).  Here  SNRi  refers  to  the  SNR  of  a  single  unrecycled  pulse  with  an  optimally  chosen  focal 
length  of  Si  =  —0.5  m,  while  SNRr  is  the  accumulated  SNR  over  r  traversals  for  the  indicated  lens  choices. 


Adding  a  diverging  lens  with  focal  length  Sj  also  modifies  the  initial  state  with  an  operator  Ul  =  exp(ikoX2 /2si). 
Hence,  powers  of  the  following  modified  operators  will  appear  in  the  full  solution, 

UeM _  =  ie-if,H/2k 0  sin  (</>/ 2  -  kx) , 

UfM+  =  e-,;P2f/fe°  cos  (</>/ 2  -  kx) , 

UeUL  =  e-ip2e/2k0eik0£2/2Si, 

which  can  be  simplified  recursively.  The  effect  of  a  diverging  lens  is  considered  for  comparison  with  the  unrecycled 
experiment  in  Ref.  [5] ,  where  such  a  lens  was  able  to  enhance  sensitivity. 

Table  II  shows  the  parameters  which  describe  the  laser  and  experimental  geometry.  Our  choice  of  k0  corresponds  to 
the  780-800  nm  lasers  used  in  [5,  9,  12,  13],  and  the  3-meter  interferometer  length  £  is  taken  from  the  generous  upper 
bound  estimate  discussed  in  subsection  I A  as  a  worst  case  scenario  for  beam  divergence  effects.  For  the  weak-value 
regime  of  small  ka  and  <j>  such  that  ka  <  <f>  <  1,  we  found  that  it  was  more  computationally  efficient  to  expand  the 
sine  and  cosine  functions  in  (27)  to  second  order  in  k  and  fourth  order  in  <f>.  To  test  the  validity  of  this  truncation, 
we  initially  set  the  interferometer  length  £  to  zero  so  that  a  comparison  could  be  made  with  the  previously  calculated 
collimated  solutions. 

We  restrict  our  attention  to  the  SNR  gains  achieved  by  recycling  a  single  pulse  for  r  traversals,  since  adding  more 
pulses  leads  to  a  simple  scaling  of  the  single  pulse  result.  The  SNR  gains  for  different  choices  of  initial  diverging  lens 
are  shown  in  Fig.  8,  where  they  are  compared  to  the  ideal  power-scaling  curve  that  we  expect  from  our  qualitative 
considerations  given  by  (2).  In  all  cases,  the  expected  gains  roughly  follow  the  qualitative  power  scaling  rule  for  a 
large  number  of  traversals  before  saturating  due  to  the  beam  growing  larger  than  the  finite  size  of  the  split-detector. 
Note  that  the  beam  divergence  mitigates  the  SNR  decay  that  was  observed  for  the  collimated  case,  even  without 
beam  stabilization  due  to  flipping  or  Zeno  reshaping. 


(27a) 

(27b) 

(27c) 


D.  Section  Conclusion 

By  investigating  the  optical  design  shown  in  Fig.  1,  we  have  shown  how  a  single  optical  pulse  can  be  trapped  inside 
the  interferometer  until  the  photons  all  exit  the  dark  port  and  are  “post-selected,”  greatly  boosting  the  sensitivity 
of  the  precision  measurement.  The  added  power  accumulated  at  the  detector  within  a  fixed  duration  of  time  is  the 
dominant  source  of  sensitivity  gain.  Further  increases  are  achievable  by  trapping  multiple  pulses  in  the  interferometer 
simultaneously.  The  number  of  trapped  pulses  is  limited  by  the  length  of  the  pulses,  the  gating  frequency  of  current 
Pockels  cells,  and  the  physical  size  of  the  interferometer. 

We  carefully  analyzed  the  case  of  a  collimated  beam  and  showed  that  repeated  post-selections  cause  a  walk-off 
effect  in  the  recycled  pulse,  which  tends  to  diminish  the  SNR.  However,  we  also  showed  that  these  walk-off  effects 
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can  be  easily  corrected  by  Zeno  reshaping,  or  by  a  parity  flip,  which  reflects  the  beam  around  its  optic  axis  on  each 
traversal.  Somewhat  surprisingly,  the  gains  with  parity  correction  can  even  exceed  those  expected  from  the  power 
scaling.  Including  propagation  effects  does  not  destroy  the  sensitivity  gain  shown  for  the  collimated  case,  but  instead 
can  produce  additional  enhancement. 

While  these  sensitivity  gains  alone  are  a  substantial  improvement  over  the  original  idea,  the  combination  of  these 
techniques  with  other  established  metrology  techniques — such  as  the  use  of  a  squeezed  reference  beam — could  further 
increase  the  sensitivity  beyond  that  indicated  here. 


II.  ANALYSIS  OF  ADVANTAGES  OF  WEAK  VALUE  AMPLIFICATION  IN  THE  PRESENCE  OF 

TECHNICAL  NOISE 

In  the  previous  section,  we  demonstrated  how  the  weak  value  amplification  techniques  can  be  further  improved 
by  recycling  most  of  the  light  emerging  from  the  bright  port  that  does  not  contain  information  about  the  desired 
parameter.  Another  very  important  issue  relates  to  how  the  original  (un-recycled)  measurement  scheme  improves  the 
accuracy  of  precision  measurement,  and  we  will  now  report  on  our  findings,  also  discussed  in  Ref.  [30].  While  we 
have  discovered  that  these  techniques,  in  and  of  themselves,  do  not  overcome  fundamental  limits  for  coherent  light 
sources  (“standard  quantum  limit”)  (see  e.g.  [9]),  there  are  technical  advantages  in  that  these  methods  make  the 
experimental  approach  to  these  limits  relatively  easy  with  common  experimental  equipment.  Many  concrete  examples 
were  mentioned  in  the  introduction  of  this  report  and  will  be  described  in  the  following  sections.  Recent  results  in 
quantum  metrology  [31]  indicate  that  under  very  general  conditions,  quantum  light  sources  do  poorly  in  the  presence 
of  additional  noise.  Therefore,  the  fact  we  demonstrate  that  the  weak  value  techniques  can  perform  much  better 
than  standard  coherent  state  methods  in  the  presence  of  technical  noise  is  a  very  exciting  prospect  for  precision 
measurements . 

Although  these  experimental  findings  have  been  employed  in  a  number  of  different  research  groups  and  applied 
to  metrological  questions  of  a  number  of  different  physical  parameters,  there  are  still  some  open  questions  and  even 
controversy  [32,  33]  about  this  technique:  precisely  how  and  to  what  extent  can  WVA  techniques  help  against  technical 
noise,  or  give  some  kind  of  technical  advantage  in  comparison  to  the  standard  measurement  techniques?  Starling  et 
al.  considered  a  particular  parameter  estimator,  showing  that  WVA  could  give  an  advantage  [9].  An  important 
step  was  made  in  this  question  when  Feizpour,  Xingxing,  and  Steinberg  [34]  were  able  to  consider  a  more  general 
kind  of  technical  noise,  and  showed  that  so  long  as  it  had  a  long  correlation  time,  WVA  also  help  suppress  it  in  the 
SNR.  In  other  closely  related  work,  Kedem  [35],  Brunner  and  Simon  [3]  and  Nishizawa  [36]  also  showed  an  increased 
performance  of  the  SNR  in  the  presence  of  technical  noise. 

In  contrast  to  these  results,  recent  papers  have  claimed  that  WVA  gives  no  technical  advantage  [32,  33].  The 
argument  is  justified  by  using  a  Fisher  information  analysis  of  technical  noise  applied  to  the  signal  carrier  (e.g., 
such  as  beam  displacement  jitter).  (We  note  the  Fisher  information  analysis  has  been  recently  applied  to  WVA  by 
other  authors  as  well  [15,  37].)  However,  this  particular  form  of  technical  noise  does  not  represent  the  complete 
picture.  There  are  many  forms  of  technical  noise  that  are  not  incorporated  in  this  model.  For  example,  in  optical 
beam  deflection,  noise  sources  include:  electronics  noise,  transverse  displacement  and  angular  jitter,  analog-to-digital 
discretization  noise,  turbulence,  vibration  noise  of  the  other  optical  elements,  spectral  jitter,  etc.  We  will  analyze 
some  of  these  models  and  examples  using  Fisher  information  and  maximum  likelihood  methods  in  order  to  understand 
in  precisely  what  sense  they  give  or  fail  to  give  a  technical  advantage,  as  well  as  describe  other  technical  advantages 
in  beam  deflection  (and  derivative)  experiments  where  the  imaginary  WVA  technique  does  lead  to  the  optimal  Fisher 
information  even  in  the  presence  of  some  types  of  noise  sources  mentioned  above. 

We  organize  this  section  as  follows.  In  Sec.  II  A,  we  introduce  the  concepts  of  Fisher  information  and  maximum 
likelihood  techniques,  and  illustrate  how  to  apply  them  to  Gaussian  random  measurements.  We  introduce  weak  value 
amplification  and  postselection  in  Sec.  II B.  Uncorrelated,  displacement  technical  noise  is  discussed  in  Sec.  II D.  Time 
correlated  technical  noise  is  analyzed  in  Sec.  HE.  Air  turbulence  is  discussed  briefly  in  Sec.  II F.  The  combination  of 
displacement  jitter  and  turbulence  is  discussed  in  Sec.  II G,  showing  the  weak  value  technique  can  suppress  both.  The 
weak  value  technique  is  shown  to  better  suppress  angular  jitter  in  deflection  measurements  in  Sec.  II H.  We  examine 
a  recent  photon  recycling  proposal  in  Sec.  Ill  and  show  the  Fisher  information  is  boosted  by  the  inverse  postselection 
probability.  Our  conclusions  are  summarized  in  Sec.  II J. 


A.  Fisher  information  of  an  unknown  parameter 

Fisher  information  describes  the  available  information  about  an  unknown  parameter  in  a  given  probability  dis¬ 
tribution.  Consider  an  unknown  parameter  d,  upon  which  some  random  variable  x  depends.  Let  the  probability 
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distribution  of  x,  given  d  be  p(x\d).  The  score  of  the  distribution  is  defined  as  S  =  ddlogp(x\d).  Assuming  p  is  a 
smooth  function,  the  average  of  S  over  p  is  0,  and  its  variance  (second  moment)  is  defined  as  the  Fisher  information, 


1(d) 


(S'2)  =  J  dxp(x\d)(dd  logp(x\d))2 
=  -  J  dxp(x\d)d2d  log p(x\d). 


(28) 

(29) 


Fisher  information  is  additive  over  independent  trials,  so  for  N  statistically  independent  measurements  (such  as  the 
collection  of  N  photons  from  a  coherent  source),  Xjv(d)  =  Nl(d). 

Consider  an  unbiased  estimator  of  d,  called  d.  This  is  any  statistical  estimator  whose  expectation  value  is  d. 
The  variance  of  d  is  bounded  from  below  by  the  Cramer-Rao  bound  (CRB),  or  Var[d]  >  l{d)~1 .  Thus,  the  Fisher 
information  sets  the  minimal  possible  estimate  on  the  uncertainty  of  d,  for  any  unbiased  estimator. 

To  illustrate  how  this  works,  let  us  consider  a  Gaussian  distribution  for  p({xi}\d),  which  will  describe  N  independent 
measurements  {xi}  of  an  unknown  mean  with  known  variance  a 2  for  each  measurement, 


PG({xi}\d) 


N 


n 


i 

\727rcr2 


exp 


(30) 


In  this  example,  the  score  is  given  by  S  =  Y^!i=i(xi  ~  d)/a2i  so  indeed  (S)  =  0,  and  the  Fisher  information  is  given  by 

lG(d)  =  Na~2.  (31) 

Thus,  the  CRB  on  the  variance  is  simply  a2 /N.  Consequently,  the  minimum  resolvable  signal  drnln  will  be  of  order  of 
dmin  ~  tx / y/N .  In  order  to  achieve  this  minimum  bound  on  the  variance,  the  optimal  unbiased  estimator  dopt  (often 
called  the  efficient  estimator)  must  be  used.  We  can  find  it  with  maximum  likelihood  methods,  by  setting  the  score 
to  zero,  and  replacing  d  by  dopt .  In  the  case  of  PG ,  we  have 


N 

S(d  y  dopt.)  —  ^  ^ (xj  dopt) / <J  —  0, 

l=i 


(32) 


so  we  find  the  efficient  estimator, 


N 

dopt  =  (l/N)Y/xj,  (33) 

l=i 

which  is  simply  the  average  of  the  data  in  this  case.  We  can  check  that  the  variance  gives  the  inverse  CRB:  (( dopt  — 
d)2)  =  (1  /N2)  J2fj(xixj)  =  cr2/N.  In  an  optical  context,  this  is  the  “standard  quantum  limit”  scaling  with  N,  which 
we  can  interpret  as  the  photon  number.  Here  the  parameter  d  can  be  interpreted  as  the  displacement  of  a  beam  with 
transverse  width  a.  One  can  immediately  see  that  the  maximum  Fisher  information  occurs  for  the  smallest  allowable 
beam  waist.  This  is  intuitively  obvious.  If  a  beam  of  very  small  waist  experiences  a  small  shift  in  its  mean  value,  a 
position  sensitive  detector  (e.g.,  a  split  detector)  would  see  a  large  change  in  intensity  as  a  function  of  its  position 
compared  to  a  beam  with  a  large  waist. 

In  the  rest  of  this  report,  we  will  consider  coherent  Gaussian  distributions  for  simplicity.  While  this  is  somewhat 
restrictive,  it  is  also  quite  reasonable  since  most  of  the  experiments  have  been  performed  using  coherent  Gaussian 
probability  densities.  This  is  also  quite  nice  theoretically,  owing  to  the  fact  that  the  log  likelihood  function  is  twice 
differentiable  and  we  can  use  variations  of  the  simple  form  of  the  Fisher  information  derived  above. 


B.  Real  weak  values  and  postselection 

To  apply  the  above  results  to  recent  optical  experiments,  we  briefly  recall  a  few  facts  about  weak  values  [1].  If  a 
quantum  system  is  prepared  in  an  initial  state  |i),  has  a  system  operator  A  that  is  measured  by  weakly  interacting 
with  a  meter  prepared  in  a  state  of  spatial  variance  a2,  and  then  postselected  in  a  final  state  | /)  with  probability 
7  =  |(/|?’)|2,  the  meter  degree  of  freedom  will  be  shifted  by  a  multiplicative  factor 


Aw  =  (f\A\i)/(f\i), 


(34) 
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where  Aw  is  the  weak  value  of  the  operator  A ,  while  leaving  the  width  a  unchanged  (for  the  moment  we  consider 
the  real  weak  value  case  for  simplicity).  Such  behavior  is  in  contrast  to  the  non-postselected  case,  where  if  the  initial 
state  is  an  eigenstate  of  A1  so  that  A\i)  =  a,i\i),  the  average  meter  shift  is  aid,  which  can  be  much  smaller  in  size  than 
the  weak  value  shift,  Awd. 

This  process  gives  rise  to  a  (normalized)  Gaussian  meter  probability  distribution  consisting  of  N'  =  yiV  measure¬ 
ment  events  |x'}, 


n' 


Pb(Wi}\d)  =  U 


i=l 


'J2'K02 


exp 


(Xj  Awd) 
2&2 


(35) 


Computing  the  CRB  on  the  variance  as  before  gives  the  score  to  be  S  =  Aw  Y^n=\{x'i  ~  Awd)/a2,  so  the  Fisher 
information  is 


W)  =  (. AwfN'a~ 2  =  (f\A\i)2Na~2.  (36) 

Note  that  the  post-selection  probability  7  canceled  out.  Therefore,  the  Fisher  information  is  the  same  as  before,  except 
for  a  factor  (f\A\i)2,  a  number  that  can  be  arranged  to  approach  1  for  a  two-level  system  with  a  judicious  choice  of 
operator,  pre-  and  post-selection  (see  Sec.  II C).  This  is  consistent  with  the  SNR  analysis  of  Ref.  [9].  Similar  points 
were  made  by  Hofmann  et  al.  [38].  We  can  then  extract  all  the  Fisher  information  from  the  weak  value,  showing 
that  ideally,  the  WV  technique  can  put  all  of  the  Fisher  information  into  the  post-selected  events,  which  matches  the 
Fisher  information  in  the  standard  methods  using  all  of  the  events.  We  note  that  it  is  not  surprising  that  considering 
any  sub-ensemble  gives  less  information  than  the  whole  ensemble.  What  is  surprising  is  that  by  using  this  particular 
small  sub-ensemble  gives  you  all  the  Fisher  information.  This  fact  alone  gives  us  technical  advantages,  as  we  shall 
see. 

Following  the  maximum  likelihood  method  presented  in  Eqs.  (32,33),  the  weak  value  efficient  estimator  is  given  by 
dwv  =  (1  /AWN')  xi- 


C.  Weak  value  for  a  two- level  system 

Our  calculations  are  based  on  an  interaction  of  the  type  U  =  exp(— i  dp  A),  where  d  is  the  unknown  small  parameter. 
Tracking/ measurements  of  the  meter  degree  of  freedom  x  after  the  postselection  gives  a  SNR  of  72.  =  (\/N  d/a)  x 
\{f\i)  Re(Au,)|.  If  instead,  measurements  of  the  conjugate  momentum  p  are  performed,  the  result  is  72  =  (y/N  d/a )  x 
\{f\i)  Im(Au,)|  [35].  We  will  show  that  under  the  right  choices  for  pre-  and  post-selection,  |(/|i)  Re(Au))|  =  1  or 

K/l*)Im(A«)i  =  I- 

We  chose  to  consider  the  operator  A  such  that  A|±)  =  ±|±),  where  |+)  and  |— )  form  an  orthonormal  basis  for  the 
Hilbert  space  of  a  two-level  system.  The  initial  state  of  the  system  can  be  written  as 

|i)  =  cos  (®)  ]+} sin  (®)  ]-},  (37) 

where  0  <  0  <  n  and  0  <  $  <  27t  represent  the  qubit  state  on  the  surface  of  the  Bloch  sphere.  The  postselection 
state  for  the  system  is  defined  as 

I/)  =  sin  +  0)  |+)  -  e^+2«  cos  +  ej  |->,  (38) 

where  9  and  <j>  are  angles  representing  the  deviation  (in  both  angular  directions  on  the  Bloch  sphere,  see  Fig.  9)  from 
the  state  orthogonal  to  |*). 


The  probability  of  the  postselection  on  |/)  and  the  weak  value  take  the  form, 


7  ~  l(/l*)|2=  cos2  </>  sin2  $  +  sin2(0  +  9)  sin2  4>, 


Re  (Aw) 


sin  9  sin(@  +  9) 

\UW2  ’ 

sin  0  sin(0  +  2  9)  sin(20) 


(39) 

(40) 


Im(Hw) 


2  ]</]*>  |2 


(41) 
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FIG.  9.  Representation  on  the  Bloch  sphere  of  the  preselection  and  postselection  states  for  0  =  7t/2.  The  state  |z)  corresponds 
to  a  unit  vector  anywhere  on  the  equatorial  plane,  and  the  state  |/)  is  off  to  be  antiparallel  to  |j)  in  the  two  angular  directions. 
The  angle  29  ( 2cj> )  gives  origin  to  the  real  (imaginary)  part  of  the  weak  value. 


FIG.  10.  | (f\i)  Re(Au,)|  (a)  and  \{f\i)  Im(Am)|  (b)  as  a  function  of  9  and  (f> ,  for  0 
are  upper  bounded  by  unity. 


=  7t/2  (see  text  for  details).  Both  functions 


Note  that  the  angles  9  and  </>  are  the  respective  generators  of  the  real  and  the  imaginary  part  of  the  weak  value.  For 
example,  a  pure  imaginary  weak  value  can  be  obtained  making  9  =  0,  so  7  =  sin2  0  sin2  (f>  and  Aw  =  —i  cot  (f>.  In  order 
to  maximize  7  we  choose  0  =  7t/2,  so  the  results  are  |z)  =  [|+)  +  el$|— )]  /y/2  and  |/)  =  \_e~l^\+)  —  )]  y/2. 

Finally,  in  the  small  angle  approximation,  7  ss  <fi2  and  Aw  ss  —i/41,  making  \{f\i)  Im(AUJ)|  ss  1.  On  the  other  side,  a 
pure  real  weak  value  can  be  obtained  making  4>  =  0,  so  7  =  sin2  9 ,  and  Aw  =  sin(0  +  9)/ sin  9.  In  order  to  make  Aw 
large  for  this  case,  we  choose  again  0  =  ir/2.  The  preselection  is  identical  to  the  former  case  |z)  =  [|+)  +  el4,|— )]  / a/2, 
and  the  postselection  takes  the  form  |/)  =  sin#  [|+)  +  ei‘I>|— )]  /\/2  +  cos  9  [|+)  —  el4>|  — )]  / a/2-  Taking  the  small  angle 
approximation  for  this  case  we  obtain  a  similar  result  to  the  pure  imaginary  weak  value  case,  7  ss  02,  Aw  ss  1/0,  and 
K/|i)Re(Af0)|«l. 


Setting  0  =  7r/2  turns  out  to  be  the  best  choice  for  practical  purposes.  We  therefore  calculate  the  products 
|(/|z)  Re(Au,)|  and  |(/|z)  Im(Aw)|  for  such  a  choice,  and  plot  them  in  Fig.  10.  It  is  shown  in  Fig.  10(a)  that  the 
largest  value  \(f\i)  Re)^)!  =  1  is  closest  reached  if  </>  =  0  and  \9\  <C  1,  which  corresponds  to  a  pure  real  weak  value 
and  an  almost  orthogonal  postselection.  Similarly,  Fig.  10(b)  shows  that  \(f\i)  Im^^)!  «  1  only  if  9  =  0  and  \4>\  1, 

defining  a  pure  imaginary  weak  value  with  almost  orthogonal  postselection. 
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D.  Type  1  technical  noise:  Displacement  noise 


As  a  next  step,  we  include  the  effect  of  one  kind  of  technical  noise.  We  first  consider  the  technical  noise  model  of 
Knee  and  Gauger,  where  the  N  random  variables  {xj}  are  independently  distributed  with  mean  d  and  variance  a2 
[32].  We  note  that  similar  models  have  also  been  previously  discussed  in  Refs.  [9,  34,  35].  Gaussian  white  technical 
noise  {£,}  is  added  to  {a:,}  with  mean  0  and  covariance  =  J2dij-  Knee  and  Gauger  comment  that  this  kind 

of  noise  might  represent  transverse  beam  displacement  jitter  (in  a  collimated  beam),  for  example.  We  call  this 
type  1  technical  noise.  The  measured  signal  will  be  the  sum  of  Si  =  x,;  +  so  the  distribution  function  for  s-i  is 
p({si}\d)  =  /  X>xX>£p({xj})p({£j})<5(sj  —  x*  —  £*),  where  /  Vx  =  n^li  I  dxi  integrates  over  all  variables.  Integrating 
over  Xj  gives  the  distribution  of  the  measured  results  as  a  convolution  of  the  two  distributions, 


p({s,;}|d)  =  II/ d&Afexp  — oa2  —  )  e  f,2/2J 

i—1 J  ' 

A  1  (  ( d~Sif\ 

nvWT^)eXPl  2  (u2  +  J2)j’ 


(42) 


where  M  is  a  normalization  constant.  Calculating  the  Fisher  information  as  before,  for  N  independent  measurements 
(photons)  gives 


?icg 


N 

a2  +  J2' 


(43) 


The  technical  noise  simply  broadens  the  width,  decreasing  the  Fisher  information.  In  the  weak  value  case,  we  follow 
the  same  procedure,  except  that  d  — ¥  Awd,  and  N  — >  N’  =  7 N,  where  7  is  the  post-selection  probability.  We  add  the 
same  technical  noise  to  the  resulting  distribution  of  post-selected  events  {s'},  and  get 


N' 


P\{s'i}\d)  =  11  /  d^Afex p  (  - 
1 


i=\" 

N 

n^TT  (a2  +  J2) 


0 dAw  s'  $i)2N)  (0-{2/2 j2 

2a2 


exp 


(dAw  -  s')2 
2(cr2  +  J2) 


(44) 


Calculating  the  Fisher  information  for  the  postselected  case  (which  must  be  reduced  by  7,  the  post-selection  proba¬ 
bility)  gives 


V  =  lAlN  (f\A\i)2N 
KG  a2  +  J2  a2  +  J2 


(45) 


where  we  used  the  weak  value  formula  (34).  Consequently,  as  before  we  find  the  Fisher  information  is  modified  by  a 
factor  of  order  1  (which  will  decrease  or  keep  the  Fisher  information  the  same,  as  discussed  before).  This  is  the  same 
conclusion  reached  by  Knee  and  Gauger  [32]  and  Ferrie  and  Combes  [33],  that  the  weak  value  amplification  offers 
no  increase  of  Fisher  information  for  technical  noise.  This  result  was  previously  found  by  Feizpour,  Xingxing,  and 
Steinberg  [34],  who  noted  that  this  type  1  technical  noise  could  not  be  suppressed  by  the  real  weak  values  technique. 
However,  as  we  will  show  in  Sec.  II G,  this  is  not  the  case  for  techniques  implementing  an  imaginary  weak  value. 

We  go  on  to  find  the  efficient  estimator  in  the  post-selected  case,  following  the  maximum  likelihood  method  presented 
in  Eq.  (32,33).  The  efficient  estimator  is  given  by  (1kg  =  ( l/AwN ')  +£»)•  In  other  words,  one  can  efficiently 

estimate  the  parameter  from  the  SNR.  We  note  that  in  either  the  standard  or  WVA  scheme,  the  Fisher  information 
can  be  further  improved  by  reducing  the  width  of  the  meter  state,  but  only  until  er  ~  J,  after  which  the  technical 
noise  dominates  the  variance.  We  will  return  to  this  point  in  Sec.  II G. 


E.  Type  2  technical  noise:  Correlated  noise  model  of  Feizpour,  Xingxing,  and  Steinberg 

In  order  to  achieve  the  CRB,  the  Fisher  information  must  not  only  be  calculated,  but  the  associated  estimator 
must  also  be  practical  to  implement.  The  point  of  practicality  of  the  efficient  estimator  can  be  strongly  made  by 
considering  the  analysis  of  Feizpour,  Xingxing,  and  Steinberg  [34].  Consider  an  experiment  with  single  photons,  such 
that  the  measurement  is  only  triggered  by  the  photon  detection  at  the  detector,  and  consists  of  N  measured  variables 
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4>j  described  by  an  average  </>  plus  a  noise  term  rjj  that  is  correlated  in  general,  (■ i)iTjj )  =  Cij  (we  define  this  as  type 
2  technical  noise).  The  noise  term  contains  both  quantum  and  technical  noise.  It  is  then  straightforward  to  check 
that  the  average  is  given  by  (1/iV)  JT=1  </>»  =  4>,  and  the  variance  is  given  by  V  =  (1  /IV2)  'Yjij{4>i4>j) ■  The  two  limits 
considered  in  Ref.  [34]  are  (i)  the  white  noise  limit,  C^  =  Cd,j,  and  (ii)  the  fully  correlated  limit  Ctl  =  C.  In  case 
(i)  we  have  V  =  C/N ,  while  in  case  (ii),  we  have  V  =  C .  Thus  averaging  helps  the  SNR  for  white  noise,  but  not  for 
fully  correlated  noise. 

Now  consider  the  postselected  case,  where  </>  —>  <j>Aw,  and  N  — >  N'  =  7 N.  In  case  (i),  the  variance  is  now  C/N', 
while  the  signal  is  Aw<f>,  so  the  SNR  scales  like  Aw^j,  so  the  small  post-selection  probability  drops  out,  indicating 
there  is  no  advantage  to  using  post-selection  in  this  case  (exactly  as  we  calculated  in  the  previous  subsections  for  type 
1  technical  noise).  However,  in  case  (ii),  the  signal  is  boosted  the  same  amount,  while  the  variance  remains  C  in  the 
fully  correlated  case.  Consequently,  the  SNR  shows  an  advantage  over  the  non-postselected  case.  Feizpour,  Xingxing, 
and  Steinberg  go  on  to  consider  a  particular  noise  model  consisting  of  a  combination  of  white  and  time-correlated 
noise,  showing  this  advantage  remains  so  long  as  the  correlation  time  remains  long  compared  to  the  photon  production 
rate  (as  it  is  in  many  optical  implementations). 

We  can  now  revisit  this  model  in  the  context  of  the  Fisher  information  metric  to  see  how  it  compares  to  the  SNR 
metric.  The  joint  probability  distribution  of  all  of  the  variables  < pi  can  be  written,  assuming  Gaussian  statistics,  as 
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1 

— /  - exp 

V/(27r)ArdetC 


(x-^)T  ■  C  1  ■  (x-^)~ 
2 


(46) 


Here,  x  is  a  vector  of  elements  <fii,  and  p  is  a  vector  of  the  means  (in  this  case,  p  =  <j)l  is  a  vector  with  the  same  mean 
in  every  element).  The  matrix  C  is  the  covariance  matrix  and  has  elements  Cij.  C~x  is  the  inverse  of  the  covariance 
matrix,  and  detC  is  its  determinant. 

In  this  case,  we  may  calculate  the  Fisher  information  (29)  about  <f>  contained  in  this  distribution,  and  obtain, 
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(47) 


Notice  this  Fisher  information  contains  a  double  sum  of  the  elements  of  the  inverse  covariance  matrix.  The  “diagonal” 
terms  describe  the  self-correlation  terms,  and  scale  proportionally  to  N ,  recovering  the  independent  trials  for  the  case 
when  the  covariance  matrix  is  diagonal.  However,  for  strong  correlations,  such  as  the  type  Feizpour,  Xingxing,  and 
Steinberg  consider,  each  element  of  the  inverse  covariance  matrix  can  be  of  comparable  value,  so  the  Fisher  information 
can  scale  at  most  like  N  ,  giving  a  much  larger  Fisher  information  than  for  uncorrelated  noise.  The  reason  is  because 
of  the  correlations  between  the  different  measurements  that  the  SNR  metric  misses. 

When  we  go  to  the  postselected  case,  the  dimension  of  the  matrix  shrinks  from  N  to  7IV,  while  the  mean  is  boosted 
by  Aw  ~  7-1/2.  The  Fisher  information  is  boosted  by  I/7  as  before,  but  the  double  sum  in  (47)  now  only  goes  to 
jN  in  the  upper  limit,  X's  =  A2W  C'iJ1  ■  The  covariance  matrix  is  different  in  general,  since  it  now  describes  the 
correlations  between  the  postselected  photons  only.  Consequently,  for  white  noise  the  Fisher  information  is  the  same, 
up  to  a  factor  of  order  1,  while  for  highly  correlated  noise,  the  Fisher  information  scales  at  most  as  N2r),  so  it  is 
actually  decreased  by  a  factor  of  7  by  the  weak  value  technique,  compared  to  the  non-postselected  case.  It  is  easy  to 
see  why:  the  correlations  of  any  single  postselected  photon  with  any  rejected  photon  are  lost  in  the  detection  scheme 
(unless  further  processing  of  the  correlated  missing  photons  is  done) ,  and  consequently  cannot  be  harnessed  to  further 
suppress  the  noise.  In  contrast  to  this,  if  the  photon  is  correlated  only  with  itself,  then  taking  a  random  postselection 
will  not  hurt,  and  the  Fisher  information  can  stay  the  same. 

However,  this  is  not  the  whole  story.  We  must  ask  what  estimator  should  be  used  that  saturates  the  CRB,  and 
whether  it  is  practical  to  implement  it.  We  can  find  this  estimator  using  maximum  likelihood  methods  described 
earlier  (32,33)  to  find  the  estimator  <j> .  We  assume  that  the  covariance  matrix  is  positive  definite,  symmetric,  and 
invertible.  It  then  follows  that  its  inverse  is  also  symmetric.  We  find  the  estimator  in  the  non  post-selected  case  to  be 
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We  can  check  the  variance  of  this  estimator  matches  the  CRB, 
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because  the  correlation  of  the  two  random  variables  gives  the  elements  of  the  covariance  matrix,  Cji,  which  cancels 
one  of  the  inverse  matrices  in  the  sums,  giving  one  factor  of  the  denominator,  resulting  in  the  CRB  we  found  above, 
the  inverse  of  (47). 

However  there  is  a  difficulty  in  this  result:  The  experiment  implementing  the  estimator  (48)  must  multiply  every 
data  point  4>j  by  a  different  weighting  factor,  fj  =  J2iLi  j'=i  which  knows  about  the  rest  of  the  data 

points.  The  experimenter  must  know  exactly  what  the  correlations  are,  how  many  data  points  are  being  collected, 
and  must  be  able  to  weight  each  data  point  by  a  different  factor  in  order  to  extract  the  maximal  information  in  the 
data  average.  This  is  generally  a  very  challenging  experimental  task.  Therefore,  the  SNR,  which  treats  every  type 
of  noise  on  equal  footing  (so  the  weighting  assignment  is  fj  =  1/N  for  all  j)  is  usually  the  most  practical  option. 
Consequently,  the  SNR,  although  suboptimal  as  a  means  for  estimating  in  this  case,  is  still  superior  since  the  optimal 
estimator  is  impractical  to  implement  for  typical  experiments.  This  is  why  the  SNR  is  used  by  experimentalists: 
because  a  complete  categorization  of  the  noise  correlations  is  a  formidable  task  requiring  detailed  knowledge  of  noise 
correlations  and  extensive  post-processing.  If  we  go  on  to  consider  non-Gaussian  correlated  noise,  such  as  1/f  noise, 
the  problem  of  implementing  the  estimator  becomes  even  worse. 


F.  Type  3  technical  noise  -  air  turbulence 


Another  type  of  technical  noise  that  is  very  important  in  open  air  experiments  is  turbulence,  which  we  refer  to  here 
as  type  3  technical  noise.  While  we  will  not  give  a  quantitative  analysis  of  this  effect  here,  it  gives  additional  beam 
width  broadening  beyond  the  diffraction  limit  because  of  beam  breathing  (on  a  short  time  scale),  and  beam  wander 
(on  a  longer  time  scale)  because  of  the  propagation  through  the  random  medium  [39].  This  becomes  an  important 
problem  when  there  is  a  large  optical  path  length  from  the  position  where  the  deflection  occurs  to  the  detector  where 
it  is  measured.  Beam  jitter  from  turbulence  cannot  be  underestimated  when  dealing  with  extremely  small  deflections. 
The  beam  wander  effects  become  important  on  time  scales  longer  than  the  ratio  of  the  beam  width  to  the  typical  air 
velocity.  Typical  experiments  can  run  between  seconds  and  hours,  so  this  effect  must  be  accounted  for.  The  weak 
values  schemes  have  a  distinct  advantage  over  the  standard  beam  deflection  measurement  in  having  short  optical  path 
lengths.  We  will  see  this  in  detail  in  the  next  subsection. 


G.  Imaginary  weak  values  and  technical  advantages 

We  proceed  to  consider  situations  where  imaginary  WVA  has  technical  advantages  for  combined  technical  noise 
types.  Kedern  points  out  that  in  the  case  of  imaginary  weak  values,  noise  in  the  average  position  does  not  appear  in 
making  measurements  in  the  momentum  basis,  and  that  noise  on  the  average  momentum  helps  the  SNR  [35].  We  focus 
on  a  different  effect  unique  to  imaginary  weak  values.  For  definiteness,  consider  beam  deflection  measurements  for  a 
coherent  Gaussian  beam  using  standard  deflection  techniques  versus  a  Sagnac  interferometer  weak  value  experiment. 
Most  experiments  used  to  date  that  show  technical  advantages  use  imaginary  weak  values. 

To  start,  assume  that  the  system  has  no  technical  noise.  The  unknown  parameter  of  interest  is  the  deflection  k  of 
the  beam,  which  can  be  interpreted  as  the  transverse  momentum  kick  given  by  a  mirror.  Treps  et  al.  showed  that  one 
can  interfere  a  local  oscillator  of  a  first  order  TEM  mode  to  achieve  the  optimal  Fisher  information  [22],  However,  for 
simplicity  and  to  understand  the  role  of  beam  diameters  we  consider  a  more  standard  approach.  A  tilt  or  deflection 
on  a  beam  requires  propagation  to  observe  a  displacement.  The  standard  method  of  measuring  an  unknown  mirror 
tilt  k  with  a  beam  of  width  a  is  to  propagate  the  light  and  focus  it  with  a  lens,  thereby  taking  the  Fourier  transform 
of  the  beam  by  measuring  the  beam  with  a  split  detector  in  the  back  focal  plane  of  the  lens  (see  Fig.  11  (a)  assuming 
q  =  0).  The  lens  transforms  the  tilt  k  into  a  displacement  fk/ko  on  the  detector,  where  /  is  the  focal  length  and  ko  is 
the  wavenumber.  The  lens  takes  the  beam  width  a  at  the  mirror  to  a  beam  width  at  the  focus  given  by  <jf  =  f  /2ko<r. 

The  single  photon  probability  distribution  function  is  then 


pn{x\k)  =  Af  exp 


(■ x  -  fk/ko)2 

2°/ 


(50) 


where  Af  is  a  normalization.  If  we  send  N  independent  photons  to  measure  k,  this  yields  a  Fisher  information  of 
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FIG.  11.  Two  strategies  to  detect  an  unknown  constant  momentum  kick  k  given  angular  jitter  of  momentum  q.  (a)  Standard 
method:  After  the  momentum  kick,  the  beam  propagates  a  distance  l\  before  getting  a  momentum  kick  k  from  the  mirror,  that 
is  the  parameter  we  wish  to  estimate.  The  beam  then  passes  through  a  lens  with  a  focus  f  =  h,  chosen  to  equal  the  distance 
traveled  until  it  comes  to  the  split  detector,  where  the  beam  reaches  its  focus  as  a  position  sensitive  detector  (PSD),  (b)  Weak 
value  type  method:  After  the  momentum  jitter  kick  q,  the  light  enters  a  Sagnac  interferometer,  comprised  of  a  50/50  beam 
splitter  (BS),  Soleil-Babinet  compensator  (SBC),  which  introduce  a  relative  phase  shift  <j>  between  the  paths,  other  mirrors, 
including  the  one  that  has  the  the  momentum  kick  k  we  wish  to  measure.  The  beam  ends  at  a  position  sensitive  detector  (PSD) 


which  is  an  intuitive  result.  This  means  that  in  order  to  get  the  smallest  beam  waist  in  the  back  focal  plane,  one 
wants  the  largest  beam  waist  possible  before  the  lens.  We  now  compare  this  with  the  Sagnac  interferometer  weak 
values  result. 

The  weak  value  technique  for  measuring  the  beam  deflection  (Fig.  11(b)  with  q  =  0)  gives  the  post-selected 
distribution,  that  when  properly  normalized,  is  a  Gaussian  distribution  with  mean  Aka2 /(j)  and  width  a2.  Here, 
(f>  is  the  phase  difference  of  clockwise  and  counter-clockwise  photons  in  the  interferometer  applied  by  the  Soleil- 
Babinet  compensator  (SBC).  This  applies  only  to  the  post-selected  fraction  N'  =  "/N  of  the  photons  that  satisfy  the 
postselection  criterion,  where  7  =  (j)2 /A.  The  postselected  single  photon  probability  distribution  function  is  then 


p’H(x\k)  =  M'  exp 


( x  —  Aka2 /(j))2 

2^ 


(52) 


The  Fisher  information  for  the  postselected  measurements,  reduced  by  the  postselection  probability  7  ( N  — >  ‘jN )  is 
then  found  to  be 

Th  =  (7iV)4cr4/(7CT2)  =  Aa2  N.  (53) 

Thus,  the  weak  values  and  standard  method  result  in  the  same  amount  of  classical  Fisher  information,  as  shown  in 
Sec.  IIC,  see  also  [40].  It  is  important  to  realize  that  this  calculation  is  only  valid  in  the  regime  of  small  angles 
sin2((/>/2)  ss  </>2/4.  In  other  words,  in  the  small  angle  or  weak  value  regime,  all  of  the  Fisher  information  is  in  the 
photons  leaving  the  dark  port  of  the  interferometer. 

So  how  does  noise  affect  the  Fisher  information  for  the  two  methods?  As  a  first  step,  we  will  assume  that  there  is 
no  transverse  beam  deflection  jitter,  but  only  transverse  detector  jitter.  In  other  words,  there  is  a  small  transverse 
deflection  k  with  no  technical  noise  on  the  beam,  but  the  detector  used  to  measure  the  beam  has  a  transverse  jitter 
£  that  we  only  sample  at  the  photon  arrival  times,  giving  type  1  technical  noise.  We  first  consider  the  standard 
method.  As  before,  the  new  likelihood  function  is  the  convolution  of  the  technical  noise-free  likelihood  function  with 

a  Gaussian  of  width  J.  This  simply  increases  the  average  beam  waist  at  the  detector  by  'a2  +  J2,  resulting  in  a 
Fisher  information 


=  N(f/k0)2  =  ANa 2 
a}  +  J2  1  4.  ( 


(54) 


This  is  a  rather  interesting  result.  It  means  that  this  approach  can  achieve  the  maximum  Fisher  information  (as  if 
there  were  no  noise  at  all),  but  only  in  the  limit  of  large  focal  length  (/  2kqaJ).  A  large  focal  length  is  equivalent 
to  opting  for  a  larger  displacement  ( fk/kq )  while  allowing  for  a  large  focal  spot  (07  J).  This  is  once  again  an 

intuitive  result:  If  a  very  small  focal  spot  lands  on  a  detector  that  has  Gaussian  random  shifts,  large  differential 
intensity  fluctuations  will  occur,  due  to  detector  jitter,  compared  with  a  large  focal  spot. 

For  the  imaginary  weak  value  approach,  the  results  are  quite  different.  The  beam  waist  for  this  case  is  given  by  a, 
not  the  focused  beam  waist  a f.  This  gives  a  Fisher  information  of 

 4iV<T4 
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Therefore,  since  cr  a f  and  there  is  freedom  to  choose  a  as  large  as  one  wishes,  we  can  make  the  beam  waist  much 
larger  than  J,  which  both  suppresses  J  and  increases  the  Fisher  information  (53).  This  shows  that  Fisher  information 
for  the  imaginary  weak  values  approach  remains  unchanged  in  the  presence  of  transverse  detector  jitter,  while  the 
focused  beam  approach  requires  long  focal  lengths. 

Hence,  the  measurement  geometry  plays  a  very  important  role  in  several  ways.  Ideally  one  would  like  a  detector 
with  a  continuous  detection  distribution,  but  practical  considerations  mean  there  will  always  be  dead  space  between 
finite  width  detectors.  A  simple  method  for  obtaining  nearly  optimal  beam  detection  is  split  detection.  However, 
even  in  this  case  there  is  always  a  gap  between  the  detectors,  which  sets  a  minimum  beam  diameter  and  thus  a  finite 
propagation  through  horizontal  turbulence.  This  is  not  a  problem  for  weak  values  since  the  detector  can  be  placed 
immediately  after  the  last  beam  splitter  which  mitigates  the  type  3  technical  noise  while  also  suppressing  type  1 
technical  noise.  The  standard  method  operating  in  the  regime  where  type  1  noise  is  completely  ameliorated  will  suffer 
from  the  type  3  technical  noise. 


H.  Type  4  technical  noise  -  angular  beam  jitter 

We  now  consider  angular  beam  jitter  by  modeling  it  as  a  random  momentum  kick  q  given  to  the  beam  before  it 
enters  the  interferometer,  or  before  it  approaches  the  signal  mirror  in  the  standard  method.  This  kick  could  be  from 
air  turbulence  or  mirror  jitter. 


1.  Standard  method  results  for  angular  jitter 


We  first  consider  the  standard  method  of  measuring  the  beam  displacement  illustrated  in  Fig.  11(a).  The  Fourier 
optics  of  this  geometry  is  described  by  a  series  of  unitary  operators  that  act  on  the  transverse  degree  of  freedom  in 
the  paraxial  approximation.  Starting  from  an  initial  transverse  state  \ip),  that  we  take  to  be  a  Gaussian  in  transverse 
position  with  zero  mean,  and  variance  a2,  the  unitary  Uq  =  exp (iqx)  gives  the  first  random  momentum  kick  q  to  the 
beam.  This  is  followed  by  a  propagation  of  distance  l\,  given  by  the  unitary  C/q  =  exp(— ip2li/2ko),  where  kg  is  the 
wavenumber  of  the  light.  This  is  followed  by  a  momentum  kick  k  described  by  Uq- and  then  the  lens  which  gives  a 
quadratic  phase  front,  Uf  =  exp(— ik^x2 /2f).  The  final  propagation  Ui2  puts  the  beam  at  the  measurement  device, 
which  is  a  position-sensitive  detector  (PSD). 

Taken  together,  we  can  describe  the  final  state  as 


ipd(x)  =  (x\Ui2UfUkUhUq\ijj). 


(56) 


To  obtain  an  explicit  form  for  the  state,  and  the  expectation  of  the  position  and  its  variance  in  this  state,  a  series 
of  complete  sets  of  states  are  inserted  between  the  unitaries.  As  an  intermediate  step,  let  us  define  state  if>i  as  the 
state  after  the  k  momentum  kick,  but  before  the  lens.  The  action  of  the  lens  is  diagonal  in  the  position  basis,  but  the 
subsequent  propagation  is  diagonal  in  the  momentum  basis.  Defining  the  coordinate  at  the  detector  as  x,  the  state 
at  the  detector  ipd{x)  is  given  by 


i>d(x) 


^  J  dy^i{y)e-ikoy2'2f-ipye-ip2l2/2ko+ipx. 


We  can  reverse  the  order  of  integration  and  perform  the  p  integration  first.  This  gives 
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Let  us  choose  I2  =  /,  so  the  quadratic  terms  in  the  exponential  cancel  out,  which  will  lead  to  focusing  the  beam.  This 
leaves  the  state  as 

Mx)  =  i^keik°x2/2f  j  dvMy)e-iykoX/f,  (59) 

which  simply  gives  the  scaled  Fourier  transform  of  tfi,  but  with  a  particular  value  of  the  momentum,  ipi(j>  — >  k$x/ /). 

The  remainder  of  the  solution  involves  finding  the  intermediate  state.  This  is  straightforward  in  the  momentum 
basis,  because  the  first  three  operators  are  diagonal  in  this  basis.  Therefore,  the  momentum-space  expression  for  ipi 
is  given  by 


i>i(p)  =  (8na2)1/4  exp(-a2 (p  +  q  +  k )2  -  il\{p  +  k)2/2k0). 


(60) 
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Putting  these  results  together,  we  find  that  the  final  state  at  the  detector  gives  a  Gaussian  distribution  in  position  x 
with  average  and  variance  of 

(x)  =  -{k  +  q)f/k0,  ((x  -  (x))2)  =  f2/{2ak0)2.  (61) 


The  net  result  is  that  the  random  momentum  kick  q  simply  adds  to  the  signal  k.  Averaging  this  distribution  of 
Gaussian  random  jitter  of  momentum  q  with  zero  mean  and  variance  Q 2  gives  another  Gaussian  distribution  for 
x,  of  mean  —kf/kg  and  wider  variance  /2[l/(2crfc0)2  +  Q2/k g].  Thus,  the  Fisher  information,  for  N  independent 
measurements,  about  k  in  this  distribution  is  given  by 


T  _  AN a2 
d  1  +  (2aQ)2  ' 


(62) 


2.  Weak  value  treatment  of  angular  jitter 


A  similar  analysis  can  be  carried  out  for  the  weak  value  interferometer  case.  In  addition  to  the  momentum  kick 
and  the  propagation  steps,  there  are  two  unitaries  that  depend  on  the  which- way  operator  W  =  |  0)(0  |  —  |  0)(0  |, 
where  the  states  |  O),  |  O)  are  clockwise  and  counterclockwise  moving  photon  states  inside  the  Sagnac  interferometer. 
Superpositions  of  these  states  are  created  by  the  50-50  beam  splitter  operating  on  the  incoming  beam.  The  unitaries 
are  the  relative  phase  shift  operator,  £7^,  =  exp(itj)W/2),  induced  by  the  SBC,  and  the  which-path  momentum  kick 
operator,  Uwk  =  exp(ikWx)  delivered  by  the  signal  mirror. 

As  shown  in  Fig.  11(b),  starting  in  the  state  |'Pi)  =  |^)(|  O)  +i\  O))/1/^,  and  ending  in  state  |'P/)  =  |a:)(|  O)  —  i\  O 
))/V2,  we  seek  the  transition  amplitude 

(^f\Ui2UkWU^UhUq\^t),  (63) 

which  describes  photons  entering  in  one  interferometer  port  and  exiting  the  other  interferometer  port.  Fortunately, 
the  which-path  states  pass  through  all  but  two  of  the  operators,  so  this  amplitude  may  be  simplified  to  eliminate  the 
which-path  states  and  operators,  giving  the  state  of  the  transverse  beam  at  the  detector,  ipwv(x ),  to  be 

ipwv(x)  =  {x\e~^2k°  sin (kx  +  (f/2)e-ip2h/2koeiqi\ ip).  (64) 


The  detailed  calculation  of  this  state  involves  inserting  complete  sets  of  states,  resulting  in  a  complicated  expression. 
The  result  is  simplified  by  expanding  to  linear  order  in  (j)  and  k  since  we  assume  the  weak  value  ordering  of  parameters, 
ka  <  <f>  <  1.  We  must  renormalize  the  post-selected  distribution  by  the  probability  7  of  a  photon  exiting  the  dark 
port,  given  by 

7  =  {4>/2)2  +  k2<r2  +  0(k2q2)  +  0(4>kq)  + -  (65) 

We  drop  the  other  terms  compared  with  (</>/ 2)2,  so  the  phase  shift  4>  controls  the  post-selection.  This  gives  an  average 
displacement  of 


_  4fca2  kh(h  +  l2)  q(h  +  l2) 
4>  k^cfa2  k0 


(66) 


where  we  suppress  higher  order  terms  in  k,  q1  </>.  The  first  term  of  order  k  is  the  usual  weak  value  term,  amplified  by 
1  /(j)  [5] .  The  second  term  comes  from  the  diffraction  effects  which  gives  a  small  correction  to  the  first  term  because  we 
take  cr2  +  h)/2ko.  The  remaining  term,  q(li  +l'2)/ko  is  just  the  free  propagation  from  the  momentum  kick, 

and  has  a  geometric  optics  interpretation  of  the  imparted  deflection  angle  q/ko  times  the  total  length  of  propagation. 
We  note  that  the  way  q  enters  into  the  average  displacement  has  a  very  different  form  than  Eq.  (61).  We  will  return 
to  this  shortly.  The  displacement  variance  at  the  detector  may  be  similarly  calculated  to  find, 
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plus  further  corrections  of  order  k2 .  Thus,  we  see  that  the  diffraction  effects  broaden  the  width  of  the  beam,  pro¬ 
portional  to  the  total  path  length.  If  we  approximate  the  distribution  as  a  Gaussian  with  the  mean  and  variance 
discussed  above,  the  Fisher  information  about  k  can  be  found  by  averaging  over  q  as  before  to  find 
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Consequently,  even  if  aQ  is  large  compared  to  1,  so  as  to  degrade  the  Fisher  information  in  the  standard  method  (62),  in 
the  weak  values  technique,  there  is  an  additional  suppression  factor  of  the  amount  of  diffraction,  (Zi  +  Z2)/(2fcoo-2)  <C  1, 
indicating  that  the  weak  value  technique  outperforms  the  standard  method  when  dealing  with  angular  jitter.  This 
result  can  be  understood  intuitively  because  the  angular  jitter  directly  adds  to  the  detected  deflection  in  the  standard 
measurement  case,  whereas  it  is  only  a  small  correction  to  the  deflection  that  is  controlled  by  diffraction  in  the  weak 
value  technique. 

The  rather  remarkable  properties  associated  with  imaginary  weak  value  experiments  may  not  be  entirely  attributed 
to  weak  values,  but  to  geometric  terms  associated  with  the  experiments.  These  terms,  such  as  beam  waist  diameter, 
arise  even  when  WVA  is  small.  However,  it  is  important  to  note  only  in  the  limit  of  large  WVA  that  all  of  the 
information  in  the  measurement  can  be  placed  in  the  measured  photons  and  that  optimal  SNR  estimation  of  the 
parameter  can  be  made.  Therefore,  these  geometric  terms  and  WVA  work  hand-in-hand  to  achieve  the  technical  noise 
suppression. 


I.  Fisher  Information  for  recycled  photons 

We  can  also  consider  the  possible  benefit  to  the  Fisher  information  of  the  photon  recycling  proposal  discussed  in 
the  previous  section  [19].  Considering  photons  as  a  resource,  one  can  ask  what  the  maximum  amount  of  information 
can  be  extracted  from  a  given  set  of  photons.  In  much  the  same  way  as  a  high  finesse  cavity  can  increase  phase 
sensitivity  in  an  interferometer,  recycling  photons  from  the  bright  port  of  a  weak  value  experiment  can  increase  the 
information  about  a  parameter.  As  discussed  previously,  the  goal  is  to  postselect  all  of  the  photons  while  keeping 
the  large  weak  value  amplification.  The  scheme  is  to  recycle  the  rejected  photons  by  closing  off  the  interferometer, 
so  the  remaining  Ni  =  N  —  N1  photons  are  re-injected  ( N'  =  7 N)  and  once  again  sample  the  unknown  parameter 
k.  We  have  discussed  that  it  is  important  that  the  rejected  light  be  reshaped  to  once  again  be  in  its  original  profile, 
otherwise  all  amplification  of  the  split  detector  SNR  is  erased  over  many  cycles.  Below,  we  consider  the  simplest 
version  where  no  propagation  effects  are  included.  In  that  case,  the  second  round  of  postselected  light  has  exactly  the 
same  distribution  on  the  detector,  with  the  same  post-selection  probability  7,  so  the  Fisher  information  for  this  cycle  is 

=  (7V1)4cr4/(7(j2)  =  Aa2Ni.  This  process  is  now  repeated  many  times,  where  Ni  is  the  number  of  rejected  photons 
on  the  ith  round,  and  we  define  Nq  =  N .  The  uniform  postselection  probability  indicates  that  Nt  =  _ZVj_i(l  —  7), 
implying  that  =  N(  1  —  7)*. 

Since  each  measurement  is  independent  from  the  last,  the  Fisher  information  simply  adds,  giving  a  total  of 

OO  A  2  TV  7* 

2tot  =  4a2^V,  =  4a2Ar^(l-7r  =  - - •  (69) 

i= 0  i—0  ^ 

Therefore,  the  Fisher  information  has  been  boosted  by  a  factor  of  I/7  compared  to  the  standard  method,  or  the  single 
pass  weak  value  method.  Here,  we  are  considering  the  photon  number  N  as  the  resource.  Of  course,  as  is  pointed 
out  in  the  first  section,  one  could  have  been  sending  more  light  onto  the  detector  using  the  standard  method  in  the 
time  taken  for  the  light  recycling  or  employed  other  standard  schemes,  but  there  could  be  many  technical  reasons 
why  this  may  be  impossible  or  inadvisable  given  a  laboratory  set-up.  There  may  be  a  minimum  quiet  time  between 
laser  pulses,  for  example,  or  the  detector  may  have  a  low  light  power  threshold.  We  note  that  the  profile  reshaping 
process  actually  removes  information  about  the  parameter  k,  but  does  so  in  a  way  that  the  estimator  can  be  simply 
and  easily  implemented  as  the  split  detection  SNR.  Together  with  the  technical  advantages  already  discussed,  this  is 
an  important  improvement  over  previous  techniques.  A  further  possibility  is  to  combine  this  technique  with  quantum 
light  techniques,  which  would  give  a  powerful  advantage  for  estimating  a  parameter.  The  fact  that  all  photons  are 
now  collected  permits  one  to  further  mine  interphoton  correlations  for  noise  suppression. 


J.  Section  conclusions 

We  have  shown  how  weak  value  based  measurement  techniques  can  give  certain  technical  advantages  to  precision 
metrology.  First  and  foremost,  the  Fisher  information  in  a  weak  value  measurement  (which  uses  a  small  fraction 
of  the  available  light)  can  be  as  large  as  the  Fisher  information  of  a  standard  measurement  (which  uses  all  of  the 
available  light).  This  is  remarkable  because  the  remaining  light  can  be  sent  to  another  experiment  [11],  or  recycled 
[19]  to  give  even  higher  amounts  of  Fisher  information. 

We  have  also  explored  technical  advantages  the  weak  value  experiments  can  have  over  standard  measurement 
techniques.  Obvious  advantages,  such  as  when  the  detectors  saturate  at  a  certain  light  intensity,  have  been  pointed 
out  in  previous  works  [5,  9].  The  possible  advantages  for  different  types  of  technical  noise  should  be  investigated  on 
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FIG.  12.  A  Gaussian  laser  beam  passes  through  a  Sagnac  interferometer  consisting  of  a  50:50  beam  splitter  (BS),  a  mirror  and 
a  prism.  The  prism  weakly  perturbs  the  direction  of  the  beam  as  the  frequency  of  the  laser  source  is  modulated,  denoted  by 
the  red  and  blue  beam  paths.  We  monitor  the  position  of  the  light  entering  the  dark  port  of  the  interferometer.  We  lock  the 
input  power  to  the  interferometer  using  a  power  measurement  before  the  BS.  The  majority  of  the  light  exits  the  interferometer 
via  the  bright  port  and  is  collected  with  an  isolator  for  use  in  an  experiment. 


a  case  by  case  basis.  There  are  cases  where  the  weak  value  techniques  gives  advantages,  and  other  cases  where  it  is 
at  a  disadvantage,  and  yet  other  cases  where  there  is  no  difference.  It  is  clear,  for  example,  that  dephasing  noise  will 
reduce  the  size  of  the  weak  value,  which  will  be  detrimental  to  this  technique  [41],  We  have  shown  how  detector  noise, 
together  with  air  turbulence  can  both  be  eliminated  by  a  weak  value  deflection  measurement,  whereas  the  conventional 
standard  method  must  suffer  from  turbulence  jitter  if  detector  noise  is  suppressed  in  open  air  experiments.  We  also 
demonstrated  that  for  angular  jitter,  the  weak  value  technique  for  beam  deflection  measurements  can  have  much 
higher  Fisher  information  than  the  standard  technique.  Considering  the  wide  range  of  experiments  that  have  now 
successfully  employed  weak  value  techniques  to  make  high  precision  measurements  [4-11,  15,  16,  34,  42],  this  should 
not  be  too  surprising. 

Another  conclusion  we  reach  is  that  it  is  not  sufficient  to  show  an  estimator  does  not  reach  the  CRB  to  decide  it 
should  be  rejected.  Rather,  the  optimal  estimator  should  be  found,  and  must  be  practically  implementable.  If  it  is 
not,  the  inefficient  -  but  practical  -  estimator  is  superior.  We  have  argued  that  time-correlated  technical  noise  is  one 
example  where  the  difficulty  of  implementing  the  optimal  estimator  is  outweighed  by  the  option  of  implementing  the 
postselection  with  amplification. 


III.  WEAK  VALUE  ENHANCED  PRECISION  FREQUENCY  MEASUREMENTS 

In  the  previous  section,  we  discussed  several  different  types  of  noise  suppression  that  the  weak  value  amplification 
technique  can  benefit  from.  Here,  we  show  practical  results  of  the  weak  value  amplification  technique  for  the  purpose 
of  precision  frequency  measurements  [11]. 

Precision  frequency  measurements  [43-45]  of  a  stabilized  laser  source  are  of  great  importance  in  the  field  of  metrology 
[46]  as  well  as  atomic,  molecular  [47]  and  optical  physics  [48].  We  now  report  on  our  results,  showing  that  weak  values 
[1,  49]  in  an  optical  deflection  measurement  experiment  [5]  can  produce  frequency  shift  resolutions  down  to  129  ±  7 
kHz/-\/Hz  with  only  2  mW  of  continuous  wave  optical  power.  By  performing  a  weak  measurement  of  the  deflection 
of  an  infrared  laser  source  that  has  passed  through  a  weakly  dispersive  prism,  we  are  able  to  measure  a  change  in 
optical  frequency  comparable  to  precision  Fabry-Perot  interferometers  [50-52].  This  technique  is  relatively  simple, 
requiring  only  a  few  common  optical  components  and  operating  at  atmospheric  pressure.  Additionally,  we  show  that 
this  technique  has  low  noise  over  a  large  range  of  response  frequencies,  making  it  desirable  for  many  applications  such 
as  Doppler  anemometry  [53],  tests  of  the  isotropy  of  light  propagation  [48]  or  laser  locking  without  the  use  of  high 
finesse  Fabry-Perot  interferometers  [54]  or  atomic  lines. 

Theory. — We  describe  here  the  frequency  amplification  experiment  shown  in  Fig.  12  by  further  developing  the  ideas 
we  previously  published  in  Ref.  5.  Although  the  actual  experiment  uses  a  classical  beam,  we  choose  to  characterize 
the  weak  value  effect  one  photon  at  a  time;  this  is  valid,  owing  to  the  fact  that  we  consider  here  a  linear  system  with 
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FIG.  13.  The  position  of  the  postselected  beam  profile  is  measured  as  we  modulate  the  input  laser  frequency  of  the  inter¬ 
ferometer.  The  modulation  oscillates  as  a  sine  wave  at  10  Hz  and  the  signal  from  the  split  detector  is  frequency  filtered  and 
amplified.  The  error  bars  are  given  by  the  standard  deviation  of  the  mean.  The  minimum  frequency  change  measured  here  is 
around  743  kHz  with  an  effective  integration  time  of  30  ms.  The  weak  value  amplification  is  approximately  79. 


a  coherent  laser  beam  modeled  as  a  linear  superposition  of  Fock  states  [18]. 

In  this  experiment,  a  single-mode  Gaussian  beam  of  frequency  w  and  radius  a  passes  through  an  optical  isolator, 
resulting  in  linearly  polarized  light.  We  assume  that  the  radius  is  large  enough  to  ignore  divergence  due  to  propagation. 
Light  then  enters  a  Sagnac  interferometer  containing  a  50:50  beam  splitter  (BS),  a  mirror  and  a  prism.  The  beam 
travels  clockwise  and  counter-clockwise  through  the  interferometer,  denoted  by  the  system  states  given  by  { |  O),  1 0)}; 
we  write  the  photon  meter  states  in  the  position  basis  as  {|x)},  where  x  denotes  the  transverse,  horizontal  direction. 

Initially,  the  interferometer  (including  the  prism)  is  aligned  such  that  the  split  photon  wave  function  spatially 
overlaps  (i.e.,  the  photons  travel  the  same  path  whether  by  |  O)  or  |  O)).  After  the  interferometer  is  aligned,  the 
photons  traversing  each  path  receive  a  small,  constant  momentum  kick  in  the  vertical  direction;  this  vertical  kick  is 
controlled  by  the  interferometer  mirror  and  results  in  a  misalignment.  Due  to  its  spatial  asymmetry  about  the  input 
BS,  this  momentum  kick  creates  an  overall  phase  difference  (/>  between  the  two  paths.  By  adjusting  the  interferometer 
mirror,  we  can  control  the  amount  of  light  that  exits  the  interferometer  into  the  dark  port.  While  the  amplified  signal 
ultimately  depends  on  the  value  of  the  which- path  phase  shift,  (f>,  and  therefore  on  the  magnitude  of  the  misalignment, 
the  SNR  is  unaffected  (discussed  below). 

We  then  let  /c(w)  represent  the  small  momentum  kick  given  by  the  prism  to  the  beam  (after  alignment)  in  the 
horizontal  ^-direction.  The  system  and  meter  are  entangled  via  an  impulsive  interaction  Hamiltonian  [5]  (resulting 
in  a  new  state  \ipi)  — >  jT))  such  that  a  measurement  of  the  horizontal  position  of  the  photon  after  it  exits  the 
interferometer  gives  us  some  information  about  which  path  the  photon  took. 

We  consider  a  horizontal  deflection  that  is  significantly  smaller  than  the  spread  of  the  wave  packet  er  we  are  trying 
to  measure,  i.e.,  k(ix)a  <C  1.  In  this  approximation,  we  find  that  the  postselected  state  of  the  photons  exiting  the 
dark  port  is  given  by 

(ipfl'i')  =  (V’/lV’i)  J  dxip(x)\x)  ex.p[-ixAwk(u)\,  (70) 

where  the  weak  value,  defined  above,  is  given  by  Aw  =  —  icot((f>/2)  ss  —2 i/cj>  for  small  </>. 

There  are  two  interesting  features  of  Eq.  (70).  First,  the  probability  of  detecting  a  photon  has  been  reduced 
to  Pps  =  \{ipf\4’i)\2  =  sin2(((>/2),  and  yet  the  SNR  of  an  ensemble  of  measurements  is  nearly  quantum  limited  [9] 
despite  not  measuring  the  vast  majority  of  the  light.  Second,  the  weak  value  (which  can  be  arbitrarily  large  in 
theory)  appears  to  amplify  the  momentum  kick  k(u>)  given  by  the  prism;  the  resulting  average  position  is  given  by 
(x)w  =  2k(u})a2\Aw\  «  Ak(u)a2  /  (f),  where  the  angular  brackets  denote  an  expectation  value.  We  can  compare  this  to 
the  standard  deflection  caused  by  a  prism  measured  at  a  distance  l  which  is  given  by  (x)  «  Zfc(w)/fco,  where  ko  is  the 
wavenumber  of  the  light. 

In  order  to  predict  the  deflections  (x)  or  (a:)^,  we  must  know  the  form  of  k(u>).  For  a  prism  oriented  such  that  it 
imparts  the  minimum  deviation  on  a  beam,  the  total  angular  deviation  is  given  by  0(w)  =  2  sin-1  [n( oj)  sin(7/2)]  —  7, 
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where  n(ui)  is  the  index  of  refraction  of  the  material  and  7  is  the  angle  at  the  apex  of  the  prism  [55].  However,  we  are 
only  interested  in  the  small,  frequency-dependent  angular  deflection  S(u)  =  A 9  =  2  An{[sin(7/2)]-2  —  [?i(w)]2}_1/2, 
where  An  (A 6)  is  the  index  change  in  the  prism  (angular  deflection  of  the  beam)  for  a  given  frequency  change  of  the 
laser.  The  small  momentum  kick  is  expressed  as  k(u>)  =  6(oj)ko,  where  ko  is  the  wavenumber  of  the  incoming  light. 
We  can  then  write  the  amplified  deflection  as 


8kocr2(An/4>) 
V[sin(7/2)]-2  -  [npp ' 


(71) 


The  frequency-dependent  index  n(ui)  of  fused  silica,  which  was  used  in  this  experiment,  can  be  modeled  using  the 
Sellmeier  equation  [56].  We  can  therefore  calculate  the  expected  (x )w  using  Eq.  (71).  However,  to  compute  the 
ultimate  sensitivity  of  this  weak  value  frequency  measurement,  we  must  include  possible  noise  sources.  If  we  consider 
only  shot-noise  from  the  laser,  the  SNR  for  small  </>  is  approximated  by 


n 


fco<T(I(w), 


(72) 


as  shown  in  Ref.  9,  where  N  is  the  number  of  photons  used  in  the  interferometer.  Note  that  N  is  not  the  number  of 
photons  striking  the  detector,  which  is  given  by  NPps.  By  setting  1Z  =  1  and  using  modest  values  for  N,  a  and  u>,  we 
find  that  frequency  sensitivities  well  below  1  kHz  are  possible.  However,  other  sources  of  noise,  such  as  detector  dark 
current,  radiation  pressure  and  environmental  perturbations  will  reduce  the  sensitivity  of  the  device. 

Experiment.  In  our  experimental  setup  (shown  in  Fig.  12),  we  used  a  fiber-coupled  780  nm  external  cavity  diode 
laser  with  a  beam  radius  of  ex  =  388  /im.  The  frequency  of  the  laser  was  modulated  with  a  10  Hz  sine  wave  using  piezo 
controlled  grating  feedback.  The  frequency  control  was  calibrated  using  saturation  absorption  spectroscopy  of  the 
hyperfine  excited  states  of  the  rubidium  D2  line[57]:  F  =  3  — >  F'  =  {2  —  4  crossover,  3  —  4  crossover,  4}  transitions 
of  rubidium  85  and  the  F  =  2— >  F'  =  {1  —  3  crossover,  2  —  3  crossover,  3}  transitions  of  rubidium  87.  Linearly 
polarized  light  was  divided  before  the  interferometer  using  a  50:50  BS  (although  an  imbalanced  ratio  here  would  be 
ideal  for  practical  applications).  The  light  in  one  port  was  measured  with  a  photodiode  and  used  to  lock  the  power 
at  2  mW  with  an  acousto-optic  modulator  before  the  fiber.  The  interferometer  was  approximately  l  =  27  cm  in 
length;  the  mirror  used  to  adjust  <j>  was  approximately  6  cm  from  the  input  BS  (measured  counter  clockwise)  and  the 
prism,  made  of  fused  silica,  was  approximately  5  cm  from  the  input  BS  (measured  clockwise).  Although  the  prism 
was  not  symmetrically  placed  in  the  interferometer  as  described  in  the  theory  above,  the  results  are  the  same  aside 
from  a  global  offset  in  position  which  can  be  subtracted  off  during  processing.  The  interferometer  was  first  aligned 
to  minimize  light  in  the  dark  port  and  then,  using  the  aforementioned  mirror,  misaligned  to  allow  a  small  percentage 
of  the  light  (~2-5%)  into  the  dark  port.  The  position  of  this  light  was  measured  using  a  split  detector  (New  Focus 
model  2921).  The  signal  was  passed  through  two  6  dB/octave  bandpass  filters  centered  at  10  Hz  and  amplified  by  a 
factor  of  about  104. 

For  Fig.  13,  we  measured  the  peak  of  the  deflection  in  each  100  ms  cycle,  repeated  25  times;  we  computed  the 
average  and  the  standard  deviation  of  this  set  as  we  varied  the  change  in  the  optical  frequency.  We  find  that  the 
amplified  deflection  is  a  linear  function  of  oscillating  optical  frequency  given  by  about  720  ±  11  prn/MHz.  Compared 
to  the  unamplihed  deflection  of  about  9.1  pm/MHz  given  by  the  expression  for  (x),  this  gives  an  amplification  factor 
of  79  ±  1.2  and  a  computed  Pps  of  1.3%;  this  agrees  with  the  measured  Pps  of  2-5%  if  we  include  the  extra  light 
present  in  the  signal  due  to  phase  front  distortions  from  imperfect  optics. 

A  characteristic  noise  scan  was  taken  and  plotted  in  Fig.  14  with  and  without  frequency  modulation.  The  signal 
was  passed  directly  from  the  split  detector  into  the  oscilloscope  before  performing  a  fast  Fourier  transform.  Data 
was  taken  with  and  without  a  7.4  MHz  optical  frequency  modulation  to  show  the  noise  floor  over  a  large  bandwidth. 
The  noise  at  higher  frequencies  was  similarly  flat.  Second,  to  test  the  range  over  which  this  device  could  function,  we 
optimized  the  interferometer  at  the  low-frequency  end  of  the  laser’s  tuning  range  and  obtained  a  SNR  of  approximately 
19  with  the  7.4  MHz  optical  frequency  modulation.  We  then  tuned  to  the  high-frequency  end  of  the  laser’s  tuning 
range  (A/  «  141  GHz),  without  adjusting  or  recalibrating  the  interferometer,  and  obtained  a  SNR  of  10.  In  fact,  this 
range  can  be  much  larger  so  long  as  the  weak  value  condition  k(uj)c r  <C  1  is  satisfied;  for  our  beam  radius  and  optical 
frequency,  we  could  in  principle  measure  over  a  range  of  5  THz,  or  about  10  nm. 

For  our  experimental  parameters,  we  can  measure  below  1  MHz  of  frequency  change  with  a  SNR  around  1,  as  shown 
in  Fig.  13.  It  should  be  noted  that,  although  the  time  between  measurements  is  a  full  100  ms,  our  filtering  limits 
the  laser  noise  to  time  scales  of  about  30  ms.  For  analysis,  we  take  this  as  our  integration  time  in  estimating  N  for 
each  measurement.  The  resulting  sensitivity  for  our  apparatus  is  129  ±  7  kHz/\/Hz;  e.g.,  if  we  had  integrated  for  1  s 
instead  of  30  ms,  this  device  could  measure  a  129  kHz  shift  in  frequency  with  a  SNR  of  1.  The  error  in  frequency  comes 
from  the  calibration  described  above.  Using  Eq.  (72),  we  find  that  the  ideal  ultimate  sensitivity  is  approximately  67 
kHz/\/Hz.  This  implies  that  this  apparatus,  operating  at  atmospheric  pressure  with  modest  frequency  filtering,  is 
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FIG.  14.  We  show  the  noise  spectrum  for  a  passive  system  (green,  solid  trace)  and  for  a  driven  system  (blue,  dashed  trace), 
where  the  laser  frequency  modulation  is  7.4  MHz  at  10  Hz.  We  see  that  the  first  harmonic  of  the  signal  is  about  5  dB  down 
from  the  fundamental  and  the  third  harmonic  is  nearly  25  dB  down.  For  a  7.4  MHz  change  in  laser  frequency,  we  see  that  the 
noise  is  approximately  35  dB  below  the  signal,  demonstrating  the  low-noise  nature  of  this  measurement. 


less  than  a  factor  of  two  away  from  the  shot-noise  limit  in  sensitivity.  We  discussed  reasons  for  this  close  approach  to 
the  shot-noise  limit  in  the  previous  section. 


IV.  PRECISION  PHASE  DIFFERENCE  MEASUREMENTS 

In  the  previous  section,  we  discussed  applications  of  weak  value  amplification  to  frequency  difference  measurements. 
In  this  section,  we  do  the  same,  but  for  phase  difference  measurements,  as  reported  on  in  Ref.  [12].  The  physics  is 
somewhat  different,  because  it  reverses  the  size  ordering  of  the  parameters.  Rather  than  have  a  small  (known)  phase 
shift  amplifying  a  much  smaller  (unknown)  momentum  kick,  it  is  the  reverse.  This  leads  to  a  qualitatively  different 
different  detector  profile. 

Phase  measurements  using  coherent  light  sources  continue  to  be  of  great  interest  in  classical  optics  [58-62].  Not 
surprisingly,  many  advances  in  phase  measurement  techniques  have  been  made  since  the  introduction  of  the  laser.  For 
instance,  Caves  emphasized  how  the  SNR  of  a  phase  measurement  can  be  improved  by  using  a  squeezed  vacuum  state 
in  the  dark  input  port  of  an  interferometer  [20].  Related  advances  in  this  area  include  the  use  of  other  non-classical 
states  of  light  such  as  Fock  states  [63]  or  the  use  of  phase  estimation  techniques  [64]  which  approach  the  Heisenberg 
limit  in  phase  sensitivity  [65].  Unfortunately,  these  states  of  light  tend  to  be  weak  and  very  sensitive  to  losses,  in 
effect  reducing  the  SNR  of  a  phase  measurement.  As  a  result,  the  use  of  coherent  light  sources  has  dominated  the 
field  of  precision  metrology  [5,  66].  In  this  case,  the  phase  sensitivity  scales  as  1/vN  rad,  where  N  is  the  average 
number  of  photons  used  in  the  measurement. 

Perhaps  the  most  famous  contemporary  phase  measurement  device  is  the  laser  interferometer  gravitational-wave 
observatory  (LIGO)  [66,  67].  LIGO  is  a  power-recycled  Michelson  interferometer  with  4  km  Fabry-Perot  arms  utilizing 
approximately  ten  Watts  of  input  power.  Although  LIGO  originally  operated  in  a  heterodyne  arrangement  using 
rf-sidebands,  the  second,  “enhanced”  stage  will  make  use  of  a  homodyne  configuration  [68].  The  expected  phase 
sensitivity  of  this  device  will  be  around  10-14  rad.  However,  the  low  saturation  intensity  of  even  state-of-the-art 
detectors  limits  the  number  of  photons  one  can  use  in  a  given  measurement  with  a  fixed  bandwidth. 

We  now  show  that  it  is  possible  to  make  a  phase  measurement  with  the  same  SNR  as  balanced  homodyne  detection 
yet  only  the  light  in  the  dark  port  is  measured.  We  use  a  coherent  light  source  with  a  split-detector  in  a  Sagnac 
interferometer  and  show  that  the  signal  of  a  phase  measurement  is  amplified.  We  derive  our  results  using  a  classical 
wave  description  and  summarize  a  quantum  mechanical  treatment  which  uses  a  similar  weak  value  formalism  to  that 
presented  in  refs.  [5,  9].  Much  like  with  weak  values  [1],  there  is  a  large  attenuation  (post-selection)  of  the  electric 
field  (number  of  photons).  Thus  we  can,  in  principle,  use  a  low-cost  detector  with  a  low  saturation  intensity  and  still 
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Laser  Pol 


FIG.  15.  (Color  Online)  Experimental  setup.  A  coherent  light  source  passes  through  a  polarizer  (Pol)  producing  horizontally 
polarized  light  before  passing  through  the  first  50/50  beamsplitter  (BS).  Half  of  the  light  strikes  a  beam  block  (BB)  and  is 
thrown  out.  The  beam  then  enters  the  Sagnac  interferometer  via  the  second  BS.  One  of  the  mirrors  of  the  interferometer  is 
controlled  by  a  piezo  actuator  (PA)  to  precisely  control  the  relative  transverse  momentum  shift  k.  We  use  a  half-wave  plate 
(HWP)  and  a  piezo-actuated  Soleil-Babinet  compensator  (SBC)  to  produce  a  relative  phase  shift  between  the  two  light  paths. 
During  split-detection,  we  monitor  the  dark  port  using  another  BS  that  splits  the  light  between  a  quad-cell  detector  (QCD) 
and  a  camera  (CCD).  During  balanced  homodyne  detection,  the  interferometer  is  balanced  and  the  flip-mirror  (FM)  is  turned 
up,  sending  both  bright  and  dark  ports  into  the  balance  detector  (BD).  We  use  a  neutral  density  filter  (ND)  in  the  bright  port 
to  correct  for  the  light  lost  at  the  first  BS. 


obtain  significantly  higher  phase  sensitivity  when  compared  to  using  a  balanced  honrodyne  detector  with  the  same 
total  incident  intensity. 

Theory  —  Consider  a  coherent  light  source  with  a  Gaussian  amplitude  profile  entering  the  input  port  of  a  Sagnac 
interferometer  as  shown  in  Fig.  15.  The  interferometer  is  purposely  misaligned  using  a  piezo-actuated  mirror  such 
that  the  two  paths  experience  opposite  deflections.  The  transverse  momentum  shift  imparted  by  the  mirror  is  labeled 
as  k.  A  relative  phase  shift  0  can  be  induced  between  the  two  light  paths  (clockwise  and  counterclockwise)  in  the 
interferometer. 

We  model  the  electric  field  propagation  using  standard  matrix  methods  in  the  paraxial  approximation.  We  can 
then  write  the  input  electric  field  amplitude  as 

Ein=  (E0e-*2/^2  o)T,  (73) 

where  a  is  defined  as  the  Gaussian  beam  radius.  The  first  position  in  the  column  vector  denotes  port  1  (see  Fig.  15) 
of  the  beam  splitter  and  the  second  position  denotes  port  2  (with  no  input  electric  field).  We  assume  that  the  beam 
is  large  enough  so  that  the  entire  Rayleigh  range  fits  within  the  interferometer.  The  matrix  representation  for  the 
50/50  beamsplitter  is  given  by 


(74) 


where  column  and  row  one  correspond  to  port  4  (counterclockwise)  and  column  and  row  two  correspond  to  port  3 
(clockwise).  We  now  define  a  matrix  that  gives  both  an  opposite  momentum  shift  k  and  a  relative  phase  shift  0 
between  the  two  light  paths 


M  = 


0 


0 

0—i(  —  kx+cl>/2) 


(75) 


The  exiting  electric  field  amplitude  is  represented  by  the  matrix  combination  ( BMB)Ein , 


E 


iEoe~x2/4a 


-  sin(fca:  —  0/2) 
cos  (for  —  0/2) 


(76) 
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Transverse  Position  (a.u.) 

FIG.  16.  (Color  Online)  Post-selected  intensity  distribution.  The  dotted  (blue)  curve  is  the  single-mode  input  profile  of  the 
beam  in  the  interferometer.  The  solid  (black)  curve  is  the  post-selected  split- Gaussian  mode  produced  by  the  misalignment  of 
the  interferometer  and  a  slight  relative  phase  shift  between  the  two  light  paths.  The  solid  lines  represent  the  tilted  wave  fronts 
in  the  two  paths  of  the  interferometer  when  they  combine  at  the  BS,  one  that  is  delayed  relative  to  the  other,  producing  an 
asymmetric  split-Gaussian  (shown).  The  dashed  line  represents  a  wave  front  with  the  same  tilt  but  zero  relative  phase  delay, 
which  would  result  in  a  symmetric  split-Gaussian  (not  shown). 


where  the  first  position  now  corresponds  to  port  2  (the  dark  port)  and  the  second  position  corresponds  to  port  1  (the 
bright  port). 

For  a  balanced  homodyne  detection  scheme,  we  take  k  =  0  and  <f>  — >  7t/2  +  <j>  and  subtract  the  integrated  intensity 
of  both  ports.  After  normalizing  by  the  total  power,  we  obtain  the  unitless  homodyne  signal 

Ah  =  sin(</>).  (77) 


Thus,  we  see  that  by  balancing  the  interferometer,  we  are  measuring  the  signal  along  the  linear  part  of  the  sine  curve 
for  small  phase  shifts. 

In  contrast,  if  we  consider  a  small  transverse  momentum  shift  (fecr  <  1)  and  monitor  only  the  dark  port,  given  by 
the  first  element  in  the  Eout  vector,  we  find  that 


E. 


(„>  „A(x_  tang/2)\  exp[-a:2/4cT2], 


where  A  =  —  iE^ka  cos(0/2).  The  intensity  at  the  dark  port  is  then  given  by 


Id{x)  =  PpsI0  ^  -  tai^//2^  ^  exp[— rr2/2er2], 


(78) 


(79) 


where  Pps  is  the  attenuation  (post-selection  probability)  of  the  measured  output  beam  given  by 

Pps  =  |fccrcos(</>/2)|2,  (80) 

and  Iq  is  the  maximum  input  intensity  density.  Aside  from  the  attenuation  factor  Ppsi  Id{x )  is  normalized  for 
vanishingly  small  <j).  Equation  (79)  is  plotted  in  Fig.  16. 

In  order  to  have  the  greatest  accessibility  for  our  results,  we  have  presented  the  theory  here  in  terms  of  classical 
wave  optics.  However,  for  an  analysis  at  the  single  photon  level,  and  an  analysis  of  the  SNR  for  precision  phase 
measurement,  it  is  of  interest  to  summarize  the  quantum  mechanical  derivation.  The  setup  may  be  interpreted  as 
the  detection  of  the  which-path  (system)  information  of  a  single  photon  (clockwise  or  counter-clockwise).  This  is 
done  indirectly  using  the  transverse  position  degree  of  freedom  of  the  photon  as  the  meter,  which  is  followed  by  a 
post-selection  of  the  system  state  (due  to  the  interference  at  the  beam  splitter) ,  allowing  only  a  few  photons  to  arrive 
at  the  split-detector  where  the  meter  is  measured. 

If  the  pre-  and  post-selected  system  states  are  almost  orthogonal  (so  kcr  -C  (f>  1),  then  there  is  an  anomalously 

large  shift  of  the  beam’s  position,  described  by  Aw  ss  —2 i/<j>  of  the  system  (see  Ref.  [5]),  as  mentioned  in  the 
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previous  section.  The  small  overlap  of  the  system’s  states  gives  rise  to  an  amplification  of  the  small  momentum  shift 
imparted  by  the  mirror.  However,  in  the  present  case,  </>  <C  fccr  <C  1,  so  the  situation  does  not  have  a  straightforward 
interpretation  in  terms  of  weak  values.  Nevertheless,  there  are  certain  features  in  common:  the  amplification  effect 
can  be  traced  back  to  the  fact  that  there  is  a  renormalization  of  the  state,  owing  to  a  small  post-selection  probability 
Pps.  A  quantitative  analysis  shows  that  the  meter  deflection  is  now  proportional  to  the  inverse  weak  value  A”1, 

(x)  =  —  2ImA“1  ss  —  (j)/k,  (81) 

which  may  be  interpreted  as  an  amplification  of  the  small  phase  shift  by  the  mirror’s  momentum  shift  k.  Notice  that 
in  contrast  to  the  usual  weak  value  case,  the  post-selected  distribution  is  not  a  simple  shift  of  the  meter  wavefunction, 
but  in  fact  corresponds  to  a  two-lobe  structure  [Eq.  (79)]  as  seen  in  Fig.  16. 

Instead  of  measuring  the  average  position,  one  can  use  a  split-detection  method  by  integrating  the  intensity  over 
the  right  side  of  the  origin  and  subtracting  from  that  the  integrated  intensity  over  the  left  side  of  the  origin.  This 
detection  method  is  well  suited  to  the  split-Gaussian  beam  and  results  in  a  split-detection  signal  which,  if  normalized 
by  the  total  power  striking  the  detector,  is  proportional  to  the  average  position.  This  quantity  is  given  by 


As 


(82) 


Despite  the  large  amplification  of  the  average  position  measurement  of  the  post-selected  photons,  the  SNR  is 
essentially  the  same  for  a  balanced  homodyne  measurement  of  phase.  The  SNR  of  a  phase  measurement  using 
balanced  homodyne  or  split-detection  can  be  expressed  as  lZh,s  =  A h,s  where  Nd  is  the  number  of  photons 

striking  the  detector.  These  expressions  are  identical,  except  for  an  overall  constant  factor  of  2/7T .  This  reduces  the 
SNR  of  the  split-detection  method  by  approximately  20%.  It  is  also  interesting  to  note  that  the  SNR  is  independent 
of  k.  Thus,  we  can  in  principle  reduce  k  (and  Pps)  arbitrarily,  allowing  us  to  increase  the  input  power  and  therefore 
N,  ultimately  improving  the  measurement  sensitivity  arbitrarily  while  using  the  same  detector. 

Experiment  —  In  the  present  experiment  (see  Fig.  15),  the  coherent  light  beam  was  created  using  an  external  cavity 
diode  laser  tuned  near  the  D\  line  of  rubidium,  approximately  795  nm.  The  beam  was  coupled  into  single  mode  fiber 
and  then  launched  to  produce  a  single  mode  Gaussian  profile.  The  light  was  collimated  with  a  radius  of  cr  «  775  pm 
and  the  continuous  wave  power  ranged  from  0.5  mW  up  to  1  mW.  The  Sagnac,  composed  of  a  50/50  beam  splitter 
and  three  mirrors,  was  rectangular.  We  used  two  configurations  for  the  geometry  of  the  interferometer,  one  with 
dimensions  39  cm  x  8  cm  (large)  and  another  with  dimensions  11  cm  x  8  cm  (small).  The  beam  profile  and  position 
of  the  post-selected  photons  were  measured  using  a  quad-cell  detector  (QCD,  New  Focus  model  2921)  functioning  as 
a  split-detector  and  a  CCD  camera  (Newport  model  LBP-2-USB).  During  balanced  homodyne  detection,  the  signal 
was  measured  using  a  Nirvana  balance  detector  (BD,  New  Focus  model  2007).  The  quantum  efficiency  of  the  BD  was 
about  81%,  whereas  the  quantum  efficiency  of  the  QCD  was  75%.  The  QCD  was  also  equipped  with  a  protective  glass 
plate  with  only  50%  transmissivity.  The  outputs  from  the  QCD  and  the  BD  were  fed  into  two  low-noise  preamplifiers 
with  frequency  filters  (Stanford  Research  Systems  model  SR560)  in  series. 

We  used  a  half-wave  plate  (HWP)  with  a  piezo-actuated  Soleil-Babinet  Compensator  (SBC)  inside  the  Sagnac 
interferometer  to  induce  a  relative  phase  shift.  The  HWP  was  oriented  such  that  the  horizontally  polarized  input 
light  was  rotated  to  vertically  polarized  light.  The  SBC  was  oriented  such  that  the  fast  axis  was  vertical  and  the 
slow  axis  was  horizontal.  The  two  light  paths  in  the  interferometer  encountered  these  optical  elements  in  opposite 
order,  allowing  for  a  known,  tunable  phase  difference  between  them.  The  piezo-actuator  in  the  SBC,  which  moved 
approximately  100  pm/mV,  imparted  a  relative  phase  shift  of  22  ±  0.9/irad/V. 

Using  the  large  configuration,  with  0.5  mW  of  input  power,  the  piezo  actuator  in  the  SBC  was  driven  with  a  20  V 
peak  to  peak  sine  wave  at  634  Hz,  corresponding  to  a  relative  phase  shift  of  440  /irad.  The  normalized  split-detection 
signal  As  (factoring  in  an  offset  from  junk-light  hitting  the  detector)  was  measured  while  the  transverse  momentum 
shift  k  was  varied  using  the  piezo-actuated  mirror.  After  scaling  As  by  the  appropriate  factor  given  in  Eq.  (82),  the 
results  were  plotted  in  Fig.  17.  The  theory  line,  which  corresponds  to  a  relative  phase  shift  of  440  /irad,  is  drawn 
along  with  the  data.  We  see  good  agreement  of  the  data  with  theory,  with  a  clear  inverse  dependence  of  ( x )  on  k. 
However,  it  should  be  noted  that  an  determination  of  k  for  this  fit  requires  calibration,  which  in  practice  is  quite 
simple. 

We  then  compared  this  split-detection  method  of  phase  measurement  to  a  balanced  homodyne  measurement.  We 
used  the  small  configuration  with  625  pW  of  (effective)  continuous  wave  input  power  -taking  into  account  various 
attenuators — and  varied  the  driving  voltage  to  the  piezo-actuator  in  the  SBC.  The  low-pass  filter  limits  the  laser  noise 
to  the  10%  to  90%  rise-time  of  a  1  kHz  sine  wave  (300/is).  We  take  this  limit  as  our  integration  time  to  determine  the 
number  of  795  nm  photons  used  in  each  measurement.  We  measured  the  SNR  of  a  phase  measurement  (see  Fig.  18) 
using  the  same  method  as  ref.  [9]  and  found  that  the  SNR  of  our  homodyne  measurement  was  on  average  3.2  below 
an  ideal  quantum  limited  system.  The  SNR  of  our  split-detection  method  was  on  average  2.6  times  below  an  ideal 
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FIG.  17.  Dependence  on  transverse  momentum.  The  transverse  momentum  shift  imparted  by  the  piezo-actuated  mirror  was 
varied  and  the  split-detection  signal  was  measured  using  the  QCD.  The  solid  line  is  the  theory  curve  from  Eq.  (81)  using  the 
expected  phase  shift. 


FIG.  18.  (Color  Online)  Experimental  comparison  of  split-detection  to  balanced  liomodyne.  We  vary  the  driving  voltage 
applied  to  the  piezo-actuator  and  measure  the  SNR  using  balanced  homodyne  detection  (orange  triangles)  and  the  split- 
detection  method  (blue  squares).  The  input  power  to  the  interferometer  is  approximately  the  same  for  both  methods.  Linear 
fits  to  the  data  (solid  lines)  show  that  these  two  methods  have  essentially  the  same  sensitivity.  The  ideal  quantum  limited  SNR 
(factoring  in  the  quantum  efficiency  of  each  detector)  is  plotted  using  a  dashed  blue  line  (split-detection)  or  a  dashed  orange 
line  (balanced  homodyne  detection).  The  dashed  black  line  illustrates  the  theoretical  (\/7V)  improvement  of  the  split-detection 
method  assuming  that  an  equal  number  of  photons  are  incident  on  both  the  split-detector  and  the  balance  detector. 


quantum  limited  system.  We  take  into  account  the  quantum  efficiency  of  each  detector  for  these  two  values,  yet  we 
ignore  any  contribution  of  dark  current  to  the  expected  SNR. 

Importantly,  the  SNR  resulting  from  both  measurement  techniques  is  approximately  the  same.  However,  the  split- 
detection  method  for  this  data  had  only  about  15%  of  the  input  light  incident  on  the  detector.  Thus,  for  diodes 
with  the  same  saturation  intensity,  it  is  possible  to  use  almost  seven  times  more  input  power  with  this  configuration, 
resulting  in  a  SNR  about  2.6  times  higher  (the  black,  dashed  line  in  Fig.  18).  The  improvement  of  the  SNR  by 
taking  advantage  of  the  attenuation  before  the  detector  has  no  theoretical  limit  and  is  only  limited  in  practice  by 
phase  front  distortions  and  back-reflections  off  of  optical  surfaces  which  degrade  the  fidelity  of  the  interference.  Using 
commercially  available  equipment  and  one  day  of  integration  time,  sub-picoradian  sensitivity  is  possible  even  with  a 
low-saturation  intensity  split-detector. 
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Part  II 

Further  results  concerning  weak  values  and  weak 
measurements 


In  the  course  of  this  investigation,  we  discovered  and  developed  a  number  of  theoretical  results  concerning  generalized 
measurements  and  weak  values  that  we  will  now  summarize  briefly  so  as  to  keep  the  report  from  getting  too  lengthy. 


A.  Contextual  Values 


There  has  been  a  conceptual  gap  between  the  notion  of  an  observable  operator  on  one  hand,  and  the  notion  of  how 
to  measure  such  an  observable  when  the  measurement  is  not  projective.  We  introduced  contextual  values  [49]  as  a 
generalization  of  the  eigenvalues  of  an  observable  that  takes  into  account  both  the  system  observable  and  a  general 
measurement  procedure.  This  technique  leads  to  a  natural  definition  of  a  general  conditioned  average  that  converges 
uniquely  to  the  quantum  weak  value  in  the  minimal  disturbance  limit.  This  approach  gives  a  theoretical  grounding 
of  the  quantum  weak  value  by  providing  a  more  general  framework.  We  give  several  examples  of  how  that  framework 
relates  to  existing  experimental  and  theoretical  results.  The  theory  of  contextual  values  extends  the  well-established 
theory  of  generalized  state  measurements  by  bridging  the  gap  between  partial  state  collapse  and  the  observables  that 
represent  physically  relevant  information  about  the  system.  We  also  wrote  a  lengthy  extension  of  these  theoretical 
results  [69]  to  connect  them  more  closely  with  both  the  quantum  operations  literature  and  other  known  results  in  pre 
and  post-selected  systems. 


B.  Two-Party  Leggett-Garg  Inequalities  with  Semiweak  Measurements 

In  joint  theory/experimental  work  of  the  Jordan  and  Howell  group  [70],  we  generalize  the  derivation  of  Leggett-Garg 
inequalities  to  systematically  treat  a  larger  class  of  experimental  situations  by  allowing  multi-particle  correlations, 
invasive  detection,  and  ambiguous  detector  results.  Furthermore,  we  show  how  many  such  inequalities  may  be  tested 
simultaneously  with  a  single  setup.  As  a  proof  of  principle,  we  violate  several  such  two-particle  inequalities  with  data 
obtained  from  a  polarization-entangled  biphoton  state  and  a  semi-weak  polarization  measurement  based  on  Fresnel 
reflection.  We  also  point  out  a  non-trivial  connection  between  specific  two-party  Leggett-Garg  inequality  violations 
and  convex  sums  of  strange  weak  values  (values  outside  of  the  eigenvalue  range). 


C. 


Measuring  Which-Path  Information  with  Coupled  Electronic  Mach-Zehnder  Interferometers 


We  now  summarize  work  done  on  electron  Mach-Zehnder  interferometers  [71],  and  how  one  interferometer  can 
be  used  as  a  detector  for  which-path  information  in  a  coupled  interferometer.  Once  the  detector  is  calibrated,  we 
calculate  weak  values  that  arise  when  one  detector  current  is  conditioned  on  the  outcome  of  the  neighboring  current. 
The  use  of  contextual  values  mentioned  above  enables  the  precise  construction  of  which-path  operator  averages  that 
are  valid  for  any  measurement  strength  from  the  available  drain  currents.  The  form  of  the  contextual  values  provides 
direct  physical  insight  about  the  measurement  being  performed,  providing  information  about  the  correlation  strength 
between  system  and  detector,  the  measurement  inefficiency,  and  the  proper  background  removal.  We  find  that  the 
detector  interferometer  must  display  maximal  wave-like  behavior  to  optimally  measure  the  particle-like  which-path 
information  in  the  system  interferometer,  demonstrating  wave-particle  complementarity  between  the  system  and 
detector.  We  also  find  that  the  degree  of  quantum  erasure  that  can  be  achieved  by  conditioning  on  a  specific  detector 
drain  is  directly  related  to  the  ambiguity  of  the  measurement.  Finally,  conditioning  the  which-path  averages  on  a 
particular  system  drain  using  the  zero  frequency  cross-correlations  produces  conditioned  averages  that  can  become 
anomalously  large  due  to  quantum  interference;  the  weak  coupling  limit  of  these  conditioned  averages  can  produce 
both  weak  values  and  detector-dependent  semi-weak  values.  This  system  has  recently  been  experimentally  realized 
[72],  and  the  experiment  demonstrates  the  quantum  eraser  effect. 
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D.  The  significance  of  the  imaginary  part  of  the  weak  value 

An  important  element  of  the  weak  value  amplification  experiments  is  the  use  of  the  imaginary  weak  value.  However, 
a  central  question  is  whether  or  not  this  can  be  interpreted  as  a  conditioned  average  of  that  observable  in  the  limit 
of  zero  measurement  disturbance.  Unlike  the  real  weak  value,  which  can  be  understood  as  a  conditioned  average, 
we  show  [73]  that  the  imaginary  part  of  the  generalized  weak  value  does  not  provide  information  pertaining  to  the 
observable  being  measured.  What  it  does  provide  is  direct  information  about  how  the  initial  state  would  be  unitarily 
disturbed  by  the  observable  operator.  Specifically,  we  provide  an  operational  interpretation  for  the  imaginary  part  of 
the  generalized  weak  value  as  the  logarithmic  directional  derivative  of  the  post-selection  probability  along  the  unitary 
flow  generated  by  the  action  of  the  observable  operator.  To  obtain  this  interpretation,  we  revisit  the  standard  von 
Neumann  measurement  protocol  for  obtaining  the  real  and  imaginary  parts  of  the  weak  value  and  solve  it  exactly  for 
arbitrary  initial  states  and  post-selections  using  the  quantum  operations  formalism,  which  allows  us  to  understand 
in  detail  how  each  part  of  the  generalized  weak  value  arises  in  the  linear  response  regime.  We  also  provide  exact 
treatments  of  qubit  measurements  and  Gaussian  detectors  as  illustrative  special  cases,  and  show  that  the  measurement 
disturbance  from  a  Gaussian  detector  is  purely  decohering  in  the  Lindblad  sense,  which  allows  the  shifts  for  a  Gaussian 
detector  to  be  completely  understood  for  any  coupling  strength  in  terms  of  a  single  complex  weak  value  that  involves 
the  decohered  initial  state. 


E.  Weak  values  are  universal  in  von  Neumann  measurements 

The  theoretical  phenomena  of  weak  values  only  exists  in  the  limit  where  the  interaction  strength  goes  to  zero. 
However,  all  real  experiments  work  at  a  finite  interaction  strength.  It  is  therefore  natural  to  ask  if  there  is  any  general 
result  that  holds  at  finite  measurement  strength.  We  find  that  for  the  commonly  used  “von  Neumann”  interaction 
model,  the  answer  is  yes  [74].  To  do  so,  we  refute  the  widely  held  belief  that  the  quantum  weak  value  necessarily 
pertains  to  weak  measurements.  To  accomplish  this,  we  use  the  transverse  position  of  a  free  particle  as  the  detector 
for  the  conditioned  von  Neumann  measurement  of  a  system  observable.  For  any  coupling  strength,  any  initial  states, 
and  any  choice  of  conditioning,  the  averages  of  the  detector  position  and  momentum  are  completely  described  by  the 
real  parts  of  three  generalized  weak  values  in  the  joint  Hilbert  space.  Higher-order  detector  moments  also  have  similar 
weak  value  expansions.  Using  the  Wigner  distribution  of  the  initial  detector  state,  we  find  compact  expressions  for 
these  weak  values  within  the  reduced  system  Hilbert  space.  As  an  application  of  the  approach,  we  show  that  for  any 
Hermite-Gauss  mode  of  a  paraxial  beam-like  detector  these  expressions  reduce  to  the  real  and  imaginary  parts  of  a 
single  system  weak  value  plus  an  additional  weak- value-like  contribution  that  only  affects  the  momentum  shift. 


F.  Quantum  instruments  as  a  foundation  for  both  states  and  observables 

This  more  theoretical  approach  works  directly  from  the  measured  results  to  make  predictions  or  retrodictions  [75]. 
We  demonstrate  that  quantum  instruments  can  provide  a  unified  operational  foundation  for  quantum  theory.  Since 
these  instruments  directly  correspond  to  laboratory  devices,  this  foundation  provides  an  alternate,  more  experimentally 
grounded,  perspective  from  which  to  understand  the  elements  of  the  traditional  approach.  We  first  show  that  in 
principle  all  measurable  probabilities  and  correlations  can  be  expressed  entirely  in  terms  of  quantum  instruments 
without  the  need  for  conventional  quantum  states  or  observables.  We  then  show  how  these  states  and  observables 
reappear  as  derived  quantities  by  conditioning  joint  detection  probabilities  on  the  first  or  last  measurement  in  a 
sequence  as  a  preparation  or  a  post-selection.  Both  predictive  and  retrodictive  versions  of  states  and  observables 
appear  in  this  manner,  as  well  as  more  exotic  bidirectional  and  interdictive  states  and  observables  that  cannot  be 
easily  expressed  using  the  traditional  approach.  We  also  revisit  the  conceptual  meaning  of  the  Heisenberg  and 
Schrodinger  pictures  of  time  evolution  as  applied  to  the  various  derived  quantities,  illustrate  how  detector  loss  can 
be  included  naturally,  and  discuss  how  the  instrumental  approach  fully  generalizes  the  time-symmetric  two-vector 
approach  of  Aharonov  et  al.  to  any  realistic  laboratory  situation. 


V.  CONCLUSIONS 

In  this  report  we  have  described  numerous  projects  worked  on  by  the  Jordan  and  Howell  groups  based  on  the  theme 
of  weak  measurements,  weak  values  and  metrology  in  years  2010-2013. 

We  discussed  a  proposed  weak  value  recycling  scheme,  where  we  have  shown  how  a  single  optical  pulse  can  be 
trapped  inside  the  interferometer  using  a  Pockels  cell  and  the  polarization  of  the  pulse.  Once  trapped,  all  the  photons 
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can  eventually  exit  the  dark  port  while  retaining  an  amplified  position,  greatly  boosting  the  sensitivity  of  the  precision 
measurement.  Pulse  reshaping  is  important  for  the  effect  to  not  degrade  over  time.  The  added  power  accumulated  at 
the  detector  within  a  fixed  duration  of  time  is  the  dominant  source  of  sensitivity  gain.  Further  increases  are  achievable 
by  trapping  multiple  pulses  in  the  interferometer  simultaneously. 

Next,  we  discussed  the  precise  nature  of  the  technical  advantages  of  weak  value  amplification  using  the  Fisher 
information  measure  in  the  presence  of  technical  noise  sources.  We  demonstrated  that  the  weak  value  technique 
concentrates  all  the  Fisher  information  about  the  parameter  into  just  the  post-selected  fraction.  When  technical 
noise  is  added,  there  can  be  situations  where  the  weak  value  technique  has  greater  Fisher  information  than  the  direct 
technique.  This  is  because  while  the  fundamental  limit  is  the  same  for  the  two  methods,  they  are  reached  in  different 
ways.  Consequently,  one  way  can  be  affected  by  the  noise  much  more  than  the  other.  For  example,  we  showed  how 
the  deflection  measurements  can  strongly  suppress  angular  jitter.  We  also  showed  that  just  because  the  direct  method 
has  higher  Fisher  information,  does  not  mean  that  it  is  necessarily  superior.  The  estimator  which  needs  to  be  used 
to  reveal  the  higher  information  may  be  technically  difficult  to  implement,  which  would  then  hinder  any  advantage. 

We  then  discussed  two  experiments  using  modified  weak  value-like  techniques  to  precisely  measure  changes  in 
frequency  and  phase.  With  only  2  rnW  of  continuous  wave  input  power,  we  have  measured  a  frequency  shift  of 
129  ±  7  kHz/\/Hz;  we  have  shown  that  the  system  is  stable  over  our  maximum  tuning  range  of  140  GHz  without 
recalibration  and  is  nearly  shot-noise-limited.  We  described  also  our  phase  amplification  technique.  The  relative 
phase  is  monitored  between  two  paths  of  a  slightly  misaligned  interferometer  by  measuring  the  average  position  of 
a  split-Gaussian  mode  in  the  dark  port.  Although  we  monitor  only  the  dark  port,  we  show  that  the  signal  varies 
linearly  with  phase  and  that  we  can  obtain  similar  sensitivity  to  balanced  homodyne  detection.  We  derive  the  source 
of  the  amplification  both  with  classical  wave  optics  and  as  an  inverse  weak  value. 

We  then  briefly  reviewed  some  of  the  other  topics  we  explored  that  were  indirectly  related  to  our  main  research 
goals.  These  included  the  contextual  value  formalism,  which  gives  a  precise  prescription  for  how  to  connect  the  general 
output  of  a  detector  to  the  behavior  of  a  coupled  system  one  is  interested  in.  We  expect  this  approach  to  have  great 
utility  in  a  wide  variety  of  experimental  contexts.  We  also  explored  the  Leggett-Garg  test  of  quantum  mechanics 
with  weak  measurements,  which  was  a  joint  theory  and  experiment  project.  The  contextual  value  approach  was  quite 
useful  here,  as  well  as  in  the  coupled  electronic  Mach-Zehnder  system.  There,  we  showed  how  one  detector’s  fringe 
information  was  an  indirect  probe  for  the  which-path  information  of  the  other  interferometer.  It  was  also  possible  to 
implement  post-selection,  which  may  be  very  useful  for  future  amplification  experiments  in  the  solid  state.  We  also 
discussed  some  discoveries  about  the  nature  of  weak  values:  we  gave  a  precise  interpretation  to  the  imaginary  part  of 
the  weak  value,  and  also  showed  a  universal  form  of  the  conditioned  average  away  from  the  weak  limit,  so  long  as  the 
von  Neumann  type  coupling  was  maintained.  The  quantum  instrument  approach  to  systems  with  post-selection  was 
also  further  developed. 

In  summary,  we  accomplished  the  main  goals  set  out  in  the  original  proposal,  and  also  were  able  to  investigate  a 
number  of  other  related  issues  that  arose  over  the  course  of  the  investigation.  Although  we  found  the  weak  value 
method  does  not  alter  the  scaling  of  the  SNR  with  respect  to  the  number  of  photons,  it  still  has  tremendous  practical 
advantages  for  many  kinds  of  parameter  estimation  tasks  as  we  have  laid  out. 
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